FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT CTGAAGAC TAACAT T T T GT GAAGT T GTAAAACAGAAAACC TG T T AGAA ATG TGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACAC TCCACCATATAGACCCGGCT T TAC C TTATATCAGTGACACT GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C TAC CAT T TATGT TCGT TATAAGCAAGT TCAT GCTC TGAGTCC TGAAGAGAACGTTAT CAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATAT GTT T GT TCAGACCATCC T T TCCT ACCAAATGCAGCCCAAAATCCATGGCAAACAAG TC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGT T T T GCACAGTGGCAAT T T T GGGACT GAT T TAGAACAGAAACTCCAT TGGAACCCCGAGG 
ACAAAGGTTAT GT GC T T CACATGATCAC TACTGCAGCAGAATGGTC TAT GTCAT TTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCC TC TAT GACAC TGCACC T T GCCC TAT T AACAAT GAACGAACACGGC TACT T TCCA 
GAGATAT TTGATGAAAGGATAAAATAT T T CT GTAATGAT TAT GAT T C TCAGGGAT TGGGGAAAGG 
T TCACAGAAG TTGC T TAT TCT TCTC TGAAAT T T TCAAC CACT TAATCAAGGC TGACAGTAACAC T 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCAT TTGATAGAT TATTCTAAAGGATATCAT 
CAAGAAGAC TAT TAAAAACACC TATGCCTATAC T T T TT TAT CTCAGAAAATAA&GT CAAAAGACT 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 

LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 

MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 

NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTT CGCCATGAGT TT CCT CATCGACT CGAGCATCATGAT TACC TCC CAGATACTATTT TTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGAT GGCAAT GGCACGGAGAACAATGT TCCAGAAGGGGGAAGT GCATAACA 
AACCATCAGGTT TCT GGGGAATGATAAAAAGT GTTACCACTTCAGCATCAGGAAGTGAAAATCT TACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCT TGAACTTAAGCCTACTACAGACT GT TAGAGGCCAGTGGT TTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCT GACATT TTATAAACAAACAAAAT GCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAAC TCAAGACAATACTCAGCAGAGAGCAT CCCGTGTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTAT GGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGT TCACATGGAAAAGGTTATAGCTTTG 
CCTT GAGATTGACTCATTAAAATCAGAGACT GTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGC CGC G 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE ± 

MS FL I DSS IMI TS QI L FFGFGWLFFMRQLFKDYE IRQYWQVI FS VT FAFS CTMFEL 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTIMMiLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGM 

LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGI QFDVKFWSQHI S FILVG 1 1 1 VTS IRGLL I TLTKFFYAI SSSKSSNVI VLLLAQIMGMY 
FVSSVLLIRMSMPLEYRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALSS ILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TATCACCACCAAAGAT CCCATAT TCAACACT CAAACT GCAACACAAACAACAGAAT T TATT GT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAACT AGCACCAT GTCTACAGAAAC T GAACCATT TGT T GAAAATAAAGCAGCAT TCAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGT CCAAGAGT CCAAGCAAAAC TACCGT GC 
GATGCC TGGAAGC TGAAGT T TAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTT T CT T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCT TGGT T CCTAAC T GGAAT CAGC TCAGGACTGCCAT T GGAC TAT GGAGT GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GC T C T G AAAG AGAAAC AC G TAT CC C AC C T G AC AT GTCCTTCT GAGC C C G G T AAGAGC AAAAG AAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCTAAAATAAAGAAAT AGAACAAGGC T GAGGATACGACAGTACAC TGTCAGCAGGGACTGTAAAC 
AC AG AC AG GG T C AAAG TGTTTTCTCT GAAC ACAT T GAG T T G G AAT C AC T G T T T AGAAC ACACAC A 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAAC TC T TATAAAT T TC TAT TT TT ATCTGAGT T ACAGAAATGAT TAC TAAGGAAGAT T 
ACT CAGTAAT T TG T T TAAAAAGTAATAAAAT T CAACAAACAT T TGC TGAATAGC TAC TATATGT C 
AAGTGCTGTGCAAGGTATTACACTC TGTAAT TGAATAT TATTCCTCAAAAAAT TGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTAT TTTT GCTGAGACTAATCTTAT TCAT TTTCTCTAATATGGCAACCATTATAACCT TAATT 
TATTATTAACATACCTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATTTTAAAAGTGCC 
AT TAACAAAT GTATCAC TAGCCCTCCT T T TT CCAACAAGAAGGGAC TGAGAGATGCAGAAATAT T 
TGTGACAAAAAAT TAAAGCAT T TAGAAAACT T 



APP ID=1 0063567 



Page 157 



FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSIAGKDQVETALKASFETCSYGWVGDGFVVISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTEVFMET STMS TE TE PFVENKAAFKNEAAG FGGVPTALLVLALL FFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGTGGAAGAGAT GGAGGCAGAAGAAGC TGC TGCTA 
AAGCAT CAT CAGAAGTGAACC TGGCAAAC TTACCT CCCAGCTATCACAAT GAGACCAACACAGAC 
ACGAAGGT T GGAAATAATACCATCCAT GT GCACCGAGAAATT CACAAGAT AACCAACAACCAGAC 
TGGACAAAT GGTC T T TT CAGAGACAGT TATCACATC TGT GGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAAT TGT TT GAGGGGAGGAGAT GGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTT TCCACGCAGT TCT TT CCATGGGCATAGGTAAGCTGTGCCT TCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T TT TC TT GAGGCAT GCACAT C TGGAAT TAAGGTCAAAC TAAT TCTCACAT CC CT CTAAAAGTAAA 
C T AC TGT TAG GAAC AG C AG T G T T C T C AC AG TGT GG GG C AG CCGTCCTTC T AAT GAAG AC AAT GAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACCTGCAGAAACAGTACT TAGGTAATTGTAGGGCGAGGATTATAAATGAAAT TTGC 
AAAATCACT TAGCAGCAACTGAAGACAAT TATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T GT AT AAGCAT GCT T TC T T TGAG T T T TAAAT T AT GT ATAAAC AT AAGT T GCAT T T AGAA 
AT CAAGCAT AAATCACT TCAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEVNLANLPPSYHNETNTDTKVGNNTIHVHREI 

TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 14 0-144 
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FIGURE Q 



CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT T GACCACAAGCT CAGAGGAGTGCAGAACCCT GTAGCCCGCT GCATCAT GTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TCT GGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGAT AT T T TGT TAACT C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVIAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

gccgggacccccgggctgccatccagaatgggttttggttctttaagttcctgatcctggtgggcctc 
h accgtgggtgccttctacatccctgacggctccttcaccaacatctggttctacttcggcgtcgtggg 
y ctccttcctcttcatcctcatccagctggtgctgctcatcgactttgcgcactcctggaaccagcggt 

ggctgggcaaggccgaggagtgcgattcccgtgcctggtacgcaggcctcttcttcttcactctcctc 
71J ttctacttgctgtcgatcgcggccgtggcgctgatgttcatgtactacactgagcccagcggctgcca 

IB CGAGGGCAAGGTCTTCATC^GCCTCAACCTCACCTOT 

|!1 CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
'H TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
1. GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
J« TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
||| GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
Q ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
fll ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNS TVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEE GAG I PT VLQGH I DCGS LLGYRAVYRMCFATAAFFFFFFTLLMLCVS S SRDPRAAI Q 
NGFWFFKFLI LVGLTVGAFYI PDGS FTNIWFYFGWGS FLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLASLHVMMTLTNWYKPGETRKMI S TOTAVWVKI CASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 



APP ID=10063567 



Page 164 



FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC TCCT GTAACCC TCC T CCAGGATGAACCACCTGCCAGAAGACAT GGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGAT TGAGTCC TATGAAGGAAGGGAAAAGAAAGGCATAT CT GAT GTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACAT TAGAGAAGGAGGTGATGCAGTATGACTACTAT TCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCC T GGAT TGAGACGTGGT TCC T GGAT T TCAAAGT GT TACCT CAAGAAGCAGAAGAAGAAAA 
CAGACT CCTGATAGT T CAGGATGC T TCAGAGAGGGCAGCACTTATACCTGGTGGTC T TT CT GATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTT TTAGAACT ATGAGTACTAC T T TTGT TAAAT GTGAAAAACCC TCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACT GCT GGCT T TAT T GAACAGCTAATAAAGAT T TATT TATT GTAATACCTCACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACTTAATATATT GAAGTAACACT TTTTTAGTAAGCAAGATACCT TTT TATTTCAATTCAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CT TGCAAAAGGGGAAGAAAGGAAT TGCGAAT ACATGTAAAAT GTCACCAGACAT T T GTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
AC AAAAT GAC T T AAAC CAT T CAT AT CAT GTTTCCTTTGCGTT C AGC C AAT T T C AAT T AAAAT GAA 
C TAAAT T AAAAA 
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FIGURE ia 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVT FDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFRFKVL I LAYAVCRLRHWWAI ALTT 
AVTSAFLIAKVILSKLFSQGAFGWLPIISFIIAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYS PPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 



Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



:; -,.-v, 
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FIGURE is 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCC TGGGGGCCAGTGAAGGAAGC TCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC T GT GT GAC T C TAGT CT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGATGC 
TT TAGGAACATGT TT TGCTT TTTTAAAATATATATAT T TATAAGAGATCCT TTCCCAT TTAT TCT 
GGGAAGAT GT TTT TCAAACTCAGAGACAAGGACTT TGGTTT TTGTAAGACAAACGATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGS FAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHE I TNETFRGLRRLERLY 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGIAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPVVREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPYSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAAIAAVGAAYCWRGRAMAAAAQDKGQ 

CEVPLMG FPGPGLQS PLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGATTCCAAGACTACTT TGACATCAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATAT TTCT TGATTCAGAAGAATCTGA 
AT TAGAAT CCT CTAT TCAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATATCAGCTT TC TAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCC T GGAGAGAGTGTCATATGC TCT T T TAT TT GGT GATTAC TT GCCACAGAATATCCAG 
GCAGCGAGAGAGATGT T T GAGAAGC TGAC T GAGGAAGGCT CT CCCAAGGGACAGAC TGCT CT T GG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
AATAATATTAACATCAGAAGAATT TGTGGTTTATAGCGGCCACAACTTTT TCAGCT TTCATGATC 
CAGATT T GCT TGTATTAAGACCAAATAT TCAGTTGAAC T TCCTTCAAAT T CT TGTTAAT GGATAT 
AACACAT GGAATCTACAT GTAAATGAAAGT T GGT GGAGT CCACAAT T T T TCT T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT TGTCAGAAT CAT TT TT TACAT TAGAT T ATCATAAT T T TAAAAATT TTTCTT TAGT T T TT CA 
AAAT T T T GT AAATGGT GGCTATAGAAAAACAACAT GAAATAT TATACAATAT T TTGCAACAAT GC 
CCTAAGAAT TGT TAAAAT TCATGGAGT TATT TGTGCAGAATGACTCCAGAGAGCTCTACTT TCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCC TGCT TCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTT TCT TTGCTAATTTGG 
AAGATTAACTCAT T TT TAATAAAAT TAT GT C TAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
E YDECTSDGREDGRLWCAT T YDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKI LNGSNKKS QKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALV Y YT FGAL GGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites . 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 

AAT TCAGAT TTTAAGCCCAT TCTGCAGTGGAAT TTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTATTATACAAGAAAGGAGTGTACCTAT CACACACAGGGGGAAAAATGCTCTT T TGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGAT T TCATGTAATCGCTGCCT GT CT GACT GAATCAGGAT CAACAGC T T TAAAGGCAGA 
AACC TCAGAGAGAC T TCGTAC TGTGC T T CTGGATGT GACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGC TGGC TCCCACTGACTGGC TGACAC TAGAGGACTACAGAGAACC TAT TGAAGTGAA 
CCTGTTT G G AC T C A T C AG T G T GACAC T AAAT AT GCTTCCTTT GG T C AAG AAAGC T CAAG G GAG AG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
T AT GC AG T G GAAGG T T T C AAT GACAGC T T AAGAC GG GACAT GAAAGC TTTTGGTGT GC AC G T CT C 
ATGCAT T GAACCAGGAT T GT TCAAAACAAAC TTGGCAGAT CCAGTAAAGGTAAT TGAAAAAAAAC 
TCGCCAT TT GGGAGCAGC TGT C TCCAGACAT CAAACAACAAT ATGGAGAAGGT TACATT GAAAAA 
AGT C TAGACAAAC TGAAAGGCAATAAAT CCTATGTGAACATGGACC TCT C TCCGGT GGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTG TGA CTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQW^ 

REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRIAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKIAIW 

LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAEIANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TAC T GAT T CCCAAATGGAT GAT GTT GAAGT T GT T TATACAATT GA 
CAT TCAGAAATATATTCCAT GCTATCAGCT T T T TAGCT T T TATAAT TCT TCAGGCGAAGT AAAT G 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCAT T CAGATCAGAT CAT GACGT T TAGAGAGAGGCT GCT T CACAAAAACT TGCAGGAGCAT T T 
TTCAAACCAAGACC TT GT TT T TCT GC TAT T AACAC CAAGT AT AATAACAGAAAGC T GCT C TAC T C 
AT CGAC T GGAACAT TCCT TATATAAACCTCAAAAAGGACTT T T TCACAGGGT ACC T T TAGT GGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TT TTAGCCGAGCAGTACAAACACACAGCTCTAAATT TTTTGAAGAAGATGGATCCTTAAAGGAGG 
TACATAAGATAAATGAAATGTATGCT T CAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGTG 
GAAGACAGTGAACAAGCAGTAGATAAAC TAGTAAAGGAT GTAAACAGAT TAAAACGAGAAATT GA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
AT GTCT T TAAAAAATAGACATGT T TCTAAAAGTAGC T GTAACTACAACCACCATCTCGAT GTAGT 
AGACAAT CT GACC T TAAT GGTAGAACACACTGACATT CC TGAAGC TAGT CCAGC TAGTACACCAC 
AAATCATTAAGCATAAAGCCT TAGACTTAGATGACAGATGGCAAT TCAAGAGATCTCGGTTGT TA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGC CCAGAAACAGATGAAGAAAT T GAAAAGATGAAGGGT TT TGGT GAATAT TCACGGTCTC 
CTACATT T TGA TCC TT TTAACC T TACAAGGAGAT T TT T T TAT T TGGC T GAT GGGTAAAGCCAAAC 
ATT TCTATT GT T T T TAC TAT G T TGAGC TACT T GCAGTAAGT T CAT T TGT T T T TAC TATGT TCACC 
T G T T T G CAGT AAT ACACAGAT AAC TCT TAG T GCAT T TAC T T CACAAAG TAC T T T T TCAAACAT C A 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT T CT T TAGAAT T GGAAAAGTGAGAC CAGGCACAGTGGCTCACACCT G TAATCCCAGCAC T 
TAGGGAAGACAAGT CAGGAGGAT T GAT T GAAGCTAGGAGT TAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTATTTTCAAAATATGGAAA 
GAAAT T TATATGAAAAT TTATCT GAGT CAT TAAAAT TCT CC T TAAGTGATAC T T T T TTAGAAGTA 
CATTAT GGCTAGAGT T GCCAGATAAAATGC T GGATAT CATGCAATAAAT T T GCAAAACATCAT CT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 



APP_ID=10063567 



Page 173 of 320 



FIGURE 22 



MEGESTSAVLSGFVXGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PC YQLFS FYNS SGE VNE QALKKI LSNVKKNWGWYKFRRHSDQ IMT FRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKG FGEYS RSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
IX AACAAGAACGATGCC CCCCT CCAC GAGAT CAACGGT GAT CATTTGAAGATCT GT CCCCAGG GTT CTACCT GCT GCTCT 

V% CAAGAGATGGAGGAGAAGTACAGC CT GCAAAGTAAAGAT GATTTCAAAAGT GTGGTCAGCGAACAGT GCAAT CAT TT G 

~2.fi?; 

Q C^AGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
|fl AATGATAT GTTT GT GAAGACATAT GG C CATTTATACATGCAAAATTCTGAGCTAT TTAAAGATCT CTTC GTAGAGTT G 

||| AAACGTTACTAC GT GGT GGGAAAT GT GAAC CT GGAAGAAAT GCTAAATGACT T CT GGGCTCGCCT CCT GGAGC GGAT G 

Iff TTCCGCCTGGTGAACTCCCAGTACC^CTTTACAGATGAGTATCT^ 

£|1 CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
S"$ TTAGCGGTT GCGGGAGATGT CGTGAGCAAGGTCT CCGTGGTAAACC CCACAGCC CAGTGTACCCAT GCC CT GTT GAAG 

3 ATGATCTACT GCT CC CACT GCCGGGGT CT CGTGACT GTGAAGCGAT GTTACAACTACTGCT CAAACAT CAT GAGAGGC 

Q TGTTTGGCCAACCAAGGGGAT CTCGATTTT GAAT GGAACAATTT CATAGAT GCTATG CT GATGGT GGCAGAGAGGCTA 

lf| GAGGGT CCTTT CAACATTGAAT CGGT CAT GGATC CCATCGAT GT GAAGATT T CT GAT GCTATTAT GAACAT GCAGGAT 

Q AATAGT GTT CAAGT GT CTCAGAAGGT TTT CCAGGGAT GTGGACCCC CCAAGCCCCT CC CAGCT GGAC GAATTTCT CGT 

HI TCCAT CTCT GAAAGTGC CTT CAGT G CT CGCTT CAGAC CACAT CACCCCGAGGAACGC CCAAC CACAGCAGCT GGCACT 

Q AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
flf GT TT GCAACGAT GAGAGGAT GGCT G CAGGAAACGGCAAT GAGGAT GACT GT T GGAATGGGAAAGGCAAAAGCAGGTAC 

CT GT TT GCAGT GACAGGAAATGGATTAGC CAACCAGGGCAACAACC CAGAGGTCCAGGT T GACAC CAGCAAACCAGAC 
ATACT GATCC TT CGTCAAATCATGGCT CTT CGAGT GATGACCAGCAAGAT GAAGAATGCATACAAT GGGAAC GACGT G 
GACTT CT TT GATAT CAGTGATGAAAGTAGT GGAGAAG GAAGT GGAAGTGGCT GT GAGTATCAGCAGT GCC CTTCAGAG 
TT TGACTACAAT GC CACTGACCAT G CT GGGAAGAGT GCCAAT GAGAAAGC C GACAGTGCTGGT GTCC GT CCT GGGGCA 
CAGGC CTACCT CCT CACTGTCTTCT GCAT CTT GT T CCTGGTTAT GCAGAGAGAGT GGAGATAATT CTCAAACTCT GAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACC^GTTATCACTTTTCTACCATCCTAGTGACTTTGCTTTTTAAATGAA 
T GGACAACAATGTACAGTTTTTACTAT GT GGC CACTG GT T TAAGAAGTGCT GACTTTGTTT TCT CATT CAGTTTTGG G 
AGGAAAAGGGACT GTGCAT TGAGTT GGTT CCT GC T CC CCCAAACCAT GT TAAAC GT GGCTAACAGT GTAGGTACAGAA 
CTATAGTTAGT T GT GCATT T GT GATT TTAT CACT CTATTATTTGT TT GTAT GTTT T TTTCT CATT T CGTTTGTGGGTT 
TT TTTTTCCAACT GTGATCT CGCCTT GTTT CTTACAAGCAAACCAGGGT C CCTT CT TGGCACGT AACAT GTACGTATT 
TCTGAAATAT TAAATAG CT GTACAGAAGCAGGTT TTATT T AT CAT GT TAT CTTAT TAAAAGAAAAAGCC CAAAAAGC 
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FIGURE 2 J. 

MARFGLPALLCTIAVLSAALLAAELKSKSCSETORLWSKGFNKNDAPLHEINGDHLKICPQGS^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 

VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDW 

MQDNSVQVSQKVFQGCGPPKPLPAGRI SRS I SESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGIANQGNNPEVQVDTS 

KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

KSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



C TCGCCC TCAAATGGGAACGC TGGCCTGGGAC TAAAGCATAGACCACCAGGC TGAGTATCC TGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCT CCAGCAGGGAACCT TCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC T T CTAGGAGAT GGCT CCAGGAAGGCGGCCAAGAATGT GAGTGCAAAGAT TGGTT CC T GAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
IT % AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
01 TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
tl| TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
tf'i GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
?'! AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 



fii 
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FIGURE 26 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCT GAC T CCCAAGAGCAACCGCAAGAT GGAGAGCAAGAAACGGGAGCT C T TCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TC T TT TT CT GGGAGCATGTGGCAGAACCATATGGAAGCT GGGCC T T CATGTGGCAGCAAGT TT T C 
GAGCCCACCT GGAAACACAT TGGGGATGGCT GC TGCCTCACCAGAGAGACCT GGAAGGATC TTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TT CCCCAGCCTCCAAT TAGAACAAGCCACCCACCAGCCTAT CTATC T T CCACT GAGAGGGACCTA 
GCAGAATGAGAGAAGACAT TCAT GTACCACCTAC TAGT CCCT C TC T CCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGT TTTGTTCTCAAATATT TT TTAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHimLLGCWQPLCKSYFPYIMAVLTPKSNRKMESKKRELFSQIKGL 

TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 

RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMW 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 

QLEQATHQPIYLPLRGT 

Important features; 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGT TTATGATTCTTATACTAAT TTATACAAAGATATTAAGGCCCTGT TCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAZy^AAAAAAAAA 



m 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 
amino acids 1-15 



H= Growth factor and cytokines receptors family: 

^ amino acids 3-18 

C 

ill 
III 
m 
m 
%j 

o 
m 
a 

ill 
13 
fll 
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FIGURE 31 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
C TGCT CGT C TTGATTT T T GT TCTCAGAAAGAGAATAAAATT GACAGT TGAGCT TT T CCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAATCATTGTCATGTACAT GCAAAACGCACT GAAAGAACAGCAGCATGGTGCATT GT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGC 
AT TC AAAATC TT GTCCAAGAAC T CAAGTCACT T TACAT CTAT TAAC TGCT TTGGAGACT T CATAA 
TT T T TC TAGGAAAGGT GT TAGTGGT GT G T T T CAC T GTT T T T GGAGGAC T CATGGCT T TTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGAT CGTCAGAAAAGCCC TACT TT AT GGATCAAGAATT TC TGAGT T T CG TA 
AAAAGGAGCAACAAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCACTCAT TAAGGAAT GAGGA 
GGGAACAGAACTC CAGGCCAT TGT GAGATAGATACCCATT TAGGTATC TGTACC TGGAAAACAT T 
TCC T TCTAAGAGCCAT T T ACAGAATAGAAGAT GAGACCACTAGAGAAAAGT TAGTGAAT T T T TTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSIAMMFTF^ 

LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFEWLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features : 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE aa 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 

ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 

GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 

AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGT TCAC T GT GCC T CAGATCAACT GC 

GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 

CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 

TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 

GGT TACAAAGGGAGT TAT T CCAACGGT GTCCAATCGT T ATCCCTACCACGAT GGAGAGAATCCT T 

TATCGT CTTAGAAAGTAAACCCAAAAAGGGTGTAACC TACCCATCAGCT CTTACATAC T CAT CAT 

CGAAAAGT CCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTAT CAGAGGCCACC TAT T CCAGGG 

ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCCAGGAGAT GGATCTCT GGT CCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 

AGAGC T GATCCAGGTAT CCAAAGGCAAGAT CCT TCAGGAGC TGCC T TCCAGAAACC TGT T GGAGC 

GGATGT CAGCC TGGGACTT GT TCCAAAAGAAGAAT TGAGCACACAGTC T T TGGAGCCAGTAT CCC 

TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA. 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGG T C C AC T GAT GGGT G T T G T CCAG TAT GGAGACAAC CC T G C TAC T CAC T T T AAC C T CAAGACAC 

ACACGAAT TCTCGAGAT CT GAAGACAGCCATAGAGAAAATTAC T CAGAGAGGAGGACTT TCTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GACTTGCGAGAGAGTCAGGAATCAACATTT TCTTCATCACCAT TGAAGGTGCTGCTGAAAATGAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 

T GGAGT T TGGGT TCGACAAGTACAGCAGCAAGCC T GACATCCT CAACGCCAT CAAGAGGGTGGGC 

TAC TGGAGT GGTGGCACCAGCACGGGGGC TGCCAT CAACTT CGCCC TGGAGCAGCT CT T CAAGAA 

GT CCAAGCCCAACAAGAGGAAGTTAAT GATCC TCATCACCGACGGGAGGT CC TACGACGACGT CC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 

GTT TGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATT TGTACAGAGT TCAACTCAC 

AGCCT CGGAACTGAAT TCAGAGCAGGCAGAGCACCAGCAAGT GC TGCT T T ACTAAC TGACGTGT T 

GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

TGATCACAAACGTATAGAAT GAGC CAAAAGGC TACATCAT GT TGAGGGT GCT GGAGATT T TACAT 

T T T GACAAT T GT T T TCAAAATAAATGT T CGGAAT ACAGT GCAGCCCT TACGACAGGC TTACGTAG 

AGC T T T T GT GAGAT T T T TAAGTTGT TAT T T CTGAT T TGAACT C TGTAACCCT CAGCAAG T T T CAT 

TT TTGTCATGACAATGTAGGAATTGCTGAAT TAAATGT TTAGAAGGATGAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGTOSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYAS YS SVCGAAVHSGVLDNSGGKI LVRKVAGQS GYKGS YSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTmQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLE PVS LGDPNCKI DLS FLI DGS TS IGKRRFRI QKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNWVWVDGWPTDKVTCEASRLARES 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKE FE I S DTDTRI GAVQYT YEQRLE FGFDKYS SKPDI LNAI KRVGYWS GGT S TGAAINFALEQL 
FKKSKPNKRKLMI L I T DGRS YDDVR I PAMAAHLKGVI T YAI GVAWAAQEE LE VI ATHPARDHSFF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami da ti on site. 

amino acids 304-308 
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FIGURE 35 



CCGAGC AC AGGAGATT GCCT GCGTTT AGGAGGTGGCTGCGT TGTGGGAAAAGCT AT CAAGGAAGAAAT T GC 
CAAACC AT GTCTTTTTTTCT GT TTTC AGAGT AGT TC AC AACAGATCTGAGTGT TTT AAT T AAGC AT GGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGAT6GCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT CAGCCT TCCCCACT AC AATGT GAT AGAACGCGT GAACTGGATGT ACTT CT ATGAGT ATGAGCCGAT T T A 
CAGACAAGACTT TC ACT T CACACT TCGAGAGCAT TC AAAC TGCT CTCAT CAAAATCCATT TC T GGT C AT TC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGT GGGGAT AT GAGGTT CT TACATTTT TCTT AT T AGGCC AAGAGGCTGAAAAGGAAG ACAAAATGTTGGC 
ATTGTCCT T AGAGGAT GAAC ACCTT CTT T AT GGT GACATAATCCGACAAGAT T TTTT AGACACATAT AAT A 
ACCTGACCT TGAAAAC C ATT AT GGCATT CAGGTGGGT AACT G AGTTTT GCCCCAAT GCCAAGT ACGT AAT G 
AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
GAAGTT TT T C AC AG GTT ATCCT CT AAT T GAT AAT TATT CCT AT AGAGGAT TT TACCAAAAAACCCAT ATT T 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCC AAGGAT CT AT GAAATGATGGGT CACGTAAAAC CC AT C AAGTT TGAAGATGTTT ATGT CGGGAT 
CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
AT T TGG AT GTCT GTCAAC TGAGAC GT GT GATT GC AGCCCATGGCTT T TCTTCCAAGGAGATC AT CACT TTT 
TGGC AGGTCATGCT AAGGAACACC ACAT GC CATT AT TAACT TCACAT TCTAC AAAAAGCCTAGAAGGACAG 
GATACC TT GTGGAAAGT GTT AAAT AAAGT AGGT ACTGTGG AAAATT C ATGGGGAGGTC AGTGTGCTGGCT T 
ACACTGAACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTCAGG 
CCCT TC AAAGAT GAT ATGTGGAGGAATT AAAT AT AAAGGAATTGGAGGTT TTTGCT AAAGAAAT T AAT AGG 
ACCAAAC AATTT GGACAT GT CATT CTGT AG ACT AGAAT T T CT TAAAAGGGT GT T ACTGAGTT AT AAGCTC A 
CTAGGCTGTAAAAACAAAAC AATGT AGAGT TTT AT TT AT T GAACAATGT AGT CACT TG AAGGTTT TGT GT A 
TATCTT AT GTGGAT TACC AATTT AAAAAT AT AT GT AGT TC TGTGTCAAAAAACTTCTTCACT GAAGTT AT A 
CTGAAC AAAAT T TT ACCTGT T TTT GGTCAT T T AT AAAGT ACT TCAAGAT GT TGC AGT AT TTCACAGT TAT T 
ATT ATTTAAAAT TACTTC AACTTT GT GT T T TT AAAT GT TT TGACGATT TC AATACAAG ATAAAAAGGATAG 
TGAATC AT TCTT TACATGCAAACATT TT CCAGTT ACTT AACT GATCAGTTT AT T AT TGAT ACAT CACTCC A 
TTAATGT AAAGT C ATAGGTC ATT ATT GC AT AT C AGT AATCT C TTGGACTTTGT T AAAT ATT T TACT GT GGT 
AATATAGAGAAG AATTAAAGCAAGAAAATC TG AAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLSL 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP IKFEDVYVGICLNLLKVNIHI PEDTNLFFLYRIHLDVCQLRRVI AAHGFSSKE 1 1 TFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites* 

amino acids 72-76, 154-158, 198-202/ 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
T CGT C CTT GCCAAGAGAGTACACAGTCAT TAATGAAGCC TGCCC TGGAGCAGAGT GGAATATCATGTGTCGGGAGT GCTGT G 
AATATGATCAGATTGAGTGC GTC TGCCCCGGAAAGAGGGAAGTCGTGGGTTATAC GATCCCT TGCTGCAGGAATGAGGAGAA 
TGAGTGTGACT CC TGC C TGATCCACCC AGGT TGTAC CATCT TTGAAAAC TGCAAGAGC TGCCGAAATGGCT CATGGGGGGGT 
ACC TTGGATGACT TC TATGT GAAGGGGT TCTACTGT GCAGAGTGCCGAGCAGGCT GGTAC GGAGGAGAC TGCATGC GATGT G 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTC^CTGTGAATGGACCATTCATGC 
TAAACCTGGGT TTGT CATCCAAC TAAGATTT GTCATGTT GAGTC TGGAGTTT GAC TACAT GTGCCAGTATGAC TAT GT TGAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAG 
GATCCTCAC TCCACGTC CTC T TCCACTCCGATGGCTCCAAGAAT T T TGACGGT TTCCAT GCCAT T TATGAGGAGAT CACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATACTGGGCAGCG^ TGTGAAAATC TCCT TGAAGAAAGAAACTGCTCAGACCC TGGGGGCCCAGT CAATGGGTACCAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
T CT TAGTGGCAAT GAGAAAAGAAC TTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATAAAAGCC TGCCGA 
GAACCAAAGAT TTCAGACCTGGTGAGAAGGAGAGT TC TTCCGATGCAGGTT CAGTCAAGGGAGACACCATTACACCAGCTAT 



" ATACC AACATCTGCATACCCAGC TCCAGTATGAGT GCATCTCACCCTTC TACCGC CGCCTGGGCAGC AGCAGGAGGACATGT 

x: C TGAGGACTGGGAAGTGGAGT GGGCGGGCACCATC C T GCAT CCC TATC TGCGGGAAAAT TGAGAACATCAC TGC TCCAAAGA 

* CCCAAGGGT TGCGCTGGCCGTGGCAGGCAGCC ATC TACAGGAGGACCAGCGGGGT GC ATGACGGCAGCC TACACAAGGGAGC 

i'J GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
: s ? GTCACCATGAT CAAGACAGCAGACCTGAAAGT TGTT T TGGGGAAATTC TACCGGGATGATGACCGGGATGAGAAGAC CATCC 

%J AGAGCC TACAGAT TT C TGC TATCATTC TGCATCCCAACTAT GACCCCATCC TGCT TGATGCTGACATCGCCATCCTGAAGC T 

III C CTAGACAAGGCCCGTATC AGCACCCGAGTCCAGCCC ATCTGCC TCGC TGCCAGT CGGGATCT CAGCAC TTCCTTCCAGGAG 

Q TCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
ffj TGGTCAGTGTGGT GGACTCGCTGC TGTGTGAGGAGCAGCAT GAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGT T 

C TGTGCCAGCT GGGAACCCAC TGCCCCT TCTGATATC TGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTC CCGGGA 
CGAGCATC TCC TGAGC CACGC TGGCATC TGAT GGGAC TGGT CAGCTGGAGC TATGAT AAAACATGCAGCCACAGGC TCTCCA 
C TGCC TTCACCAAGGT GC TGC CTTTTAAAGAC TGGATTGAAAGAAATATGAA AT GAA CCATGC TCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTC^GGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TAC TAGGACAGCCAATTGGAAGATGCCAGGGC TTGCAAGAAGT AAGT T TC TT CAAAGAAGACCATATACAAAACC TCT CCA 
C TCC ACTGACC T GGTGGTCT T CCCCAACT TTCAGT TATACGAATGCCATCAGC TTGACCAGGGAAGAT C TGGGCTTCATGAG 
GCCCC TT TTGAGGCT C TCAAGTTCTAGAGAGC TGC C T GTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCT T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
I PCCRNEENECDSCL IHPGCT I FENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP IQS I GS SLHVLFHSDGSKNFDGFHAI YE E I TACS S SPC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKI TGGPGL I NGRHAKI GTWS FFCNNS YVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALP FGDLPMGYQHLHTQLQYEC I SP FYRRLGS SRRTCLRTGKWSGRAPS CI PI CGKI ENI TAP 
KTQGLRWPWQAAI YRRT S GVHDGS LHKGAWFLVCSGALVNERTVWAAHCVT DLGKVTMIKT ADL 
KWLGKFYRDDDRDEKT IQSLQI SAI I LHPNYDP I LLDADIAI LKLLDKARIS TRVQP I CLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSVVDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDI C TAETGGI AAVS FPGRAS PEPRWHIjMGLVSWS YDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 
amino acids 1-23 
I EGF-like domain cysteine pattern signature, 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 
Serine proteases, trypsin family, 
amino acids 489-506 
CUB domain proteins profile, 
amino acids 150-167 
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FIGURE so 



GGTTCCTACATCCT CTCAT CTGAGAAT CAGAGAGCATAAT CTT CT TAC GGGCCCGTGATTTATTAAC GTGGCTTAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTT GGGCCCTTGTAGCTGACAGAAGGT GG CCAGGGAGAATGCAGCACACT GCTC GGAG AATGA AGGCGCTTCT GT TGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCTCCCACTACGGC CTGACCAAAGATAGGAAGAGGC GCT CACAAGATGGCTGT CCAGACG GCT GTGCGAGC C 
TCACAGCCACGGCT C CCTCC CCAGAGGT TT CT GCAGCTGCCACCATCT CCTTAATGACAGACGAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTT GAGAGAT CCACTATTAGAAGCAGATCAT TTAAAAAAATAAAT CGAGCTTT GAGTGTTCTTC GAA 
GGACAAAGAGCG GGAGT GCAGT TGCCAACCAT GC CGACCAGGGC^GGGAAAATT CT GAAAACACCACT GCCCCTGAAG 
T CTTT CCAAGGTTGTAC CACCT GATT CCAGATGGT GAAATTACCAGCAT CAAGAT CAAT C GAGTAGATCCCAGT GAAA 
GCCTCT CTATTAGGCTGGT GGGAGGTAGC GAAACCCCACT GGTCCATAT CATTATCCAACACATT TATCGTGATGGGG 
T GAT CGCCAGAGACGGC CGGCTACTGCCAGGAGACAT CATT CTAAAGGT CAACGGGATGGACATCAGCAATGTCCCT C 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAATGGACAGGCCCCGGAT GCCTACAGAC CCCGAGAT GACAGCT TTCATGTGATT CTCAACAAAAGTAGC C 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GTGGCATAT C GACAT GGT CAGCTT GAGGAGAAT GACC GTGTGTTAGCCAT CAAT GGACAT GATCTTC GATAT GGCA 
GCCCAGAAAGTGCGGCT CATCT GATT CAGGC CAGT GAAAGACGT GTTCACCT CGTCGTGTCC CGCCAGGTT CGGCAGC 
GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGCAATGGC!AGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAA 
GCAT GACCGT CGCAGGG GGAGCAT CACATAGAGAAT GGGATTT GC CTAT CTATGT CATCAGTGTT GAGCCC GGAG GAG 
T CATAAGCAGAGAT GGAAGAATAAAAACAGGTGACAT TTT GT TGAAT GTGGATGGGGTCGAACTGACAGAGGTCAGC C 
GGAGT GAGGCAGTGGCATTATTGAAAAGAACATCAT C CTCGATAGTACT CAAAGCTTTGGAAGT CAAAGAGTAT GAGC 
C CCAGGAAGACT GCAGCAG CCCAGCAGCCCT GGACT CCAACCACAACAT GGCCCCACCCAGT GACT GGT C CCCAT CCT 
GGGT CATGTGGCT GGAATTACCACGGT GCTT GTATAACT GTAAAGATATT GTATTACGAAGAAACACAGCT GGAAGT C 
T GGG CTT CT GCATT GTAGGAGGTTAT GAAGAATACAATGGAAAGAAACCTTTTT TCATCAAATC CATTGTTGAAGGAA 
CACCAGCATACAATGAT GGAAGAATTAGATGT GGTGATATTCTT CTTGCT GT CAAT GGTAGAAGTACAT CAGGAAT GA 
TACAT GCTT GCTT GGCAAGACT GCTGAAAGAACTTAAAGGAAGAATTAC TCTAACTATTGT TTCTTGGCCT GGCACT T 
TTTTATAGAATCAATGAT GGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATATTTAT C 
TTGT CAGTTTT TATATTTAAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGTAT GAT CATCTAAT GAAAGCCAGTT 
ACACCT CAGAAAATATGATTCCAAAAAAATTAAAACTACTAGTTTTT TT T CAGT GT GGAGGATTT CTCATTACT CTAC 
AACATT GTTTATATTTTTTCTATTCAATAAAAAG CC CTAAAACAACTAAAAT GATT GATTTGTATACCCCACTGAATT 
CAAGCT GATTTAAATTTAAAATTT GGTATAT GCT GAAGT CT GC CAAGGGTACATTATGGCCAT T TTTAATTTACAGCT 
AAAATATTTTTTAAAAT GCAT T GCTGAGAAAC GTTGCTTTCAT CAAACAAGAATAAATATTTTT CAGAAGT T AAA 
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FIGURE AO 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI I IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 

SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RI RCGD I LLAVNGRS TS GMI HACLARLLKELKGRI TLT I VS WPGT FL 



N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 




Important features: 
Signal peptide: 

amino acids 1-15 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAA CATG GGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCC TCAT TT TGGGGAAGGGAAAAACT CTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAAC T GT CCT T CT G T GT C T CCTTACC T CAGAGGC CAGAGCAAGCT CATT T T CAAAC CAGATC T CAC 
T TT GGAAGAGGTACAGGCAGAAAAT CCCAAAGT GT CCAGAGGCCGGTAT CGCCCTCAGGAATGTAAAG 
C TT T ACAGAGGG T CGC CAT C CT C GT T C C CCAC C GGAACAGAGAGAAACACCT GAT GT AC CT GCT GGAA 
CATCTGCAT CCCT TCCTGCAGAGGCAGCAGCT GGAT TAT GGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATC CCAAGCAT C TGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTAC GTTACAGTGGATATT TTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGG GAGGCGAAGACGATGAC C T CAGACT C AGG GTT GAGC T CCAAAGAAT GAAAAT TTCCCGGCCCCTG 
CCT GAAGTGGGTAAATATACAATGGTCTT CCACACTAGAGACAAAGGCAAT GAGGTGAACGCAGAACG 
GAT GAAGCTCTTACACCAAGT GT CACGAGTCTGGAGAACAGATGGGT TGAGTAGTTGTTCT TATAAAT 
TAGTATCTGTGGAACACAATCCT TTATATATCAACATCACAGTGGAT T TCT GGTT TGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
T AGC AGAGC T CCT GG T GAT GTAGAG TATAAAACAG T T GTAACAAGACAG CT T T CT TAGTCATT T T GAT 
CATGAGGGTTAAATAT TGTAATATGGATACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGC TAT TTGAGGACT CTGGTTGAAGGAGATTTAT TTAAATT TGAAGTAATATATTATGGGAT 
AAAAG GC C ACAG GAAATAAGACT GCT GAATGT CTGAGAGAACCAGAG TTGTTCTCGTC CAAGGTAGAA 
AGGT AC GAAGAT ACAAT AC TG T TAT T CAT TTATC C T GTACAAT CATC T G TGAAGT GGTGGT GTC AGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGTT TTT GTAAAATGAT TTTGTACAAGTAGGATAT GAAT TAGCAGTTTACAAGTTT ACAT 
ATTAACTAATAATAAATATGT CTAT CAAATACCTCT GTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGWLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHIMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE A* 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAA!\AAAAAAA?\AAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE ±± 

MALS S Q I WAACLLLLLLLASLT SGS VFPQQTGQLAELQPQDRAGAEIASWMPMFQRRRRRDTHFP I 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE ax 

GTGGCT TCAT T TCAGTGGC TGAC T TCCAGAGAGCAATATCGC TGGT TCCCCAACATGCCT CACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAAT GAC T CAGGGAT CTACTATGTGGGGATATACAGC T CATCAC T CCAGCAGCCC TCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACC TGT GTGACCAATC T GACATGC TGCATGGAACAT GGGGAAGAGGAT GT GAT T 

TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGATGGGGAGAAAGTGATATGACCT TCATCT GCGTTGCCAGGAACCCT GT CAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T TCT GAAGAGAGAGAGACAAGAAGAGT ACAT TGAAGAGAAGAAGAGAGTGGACAT TT GTCGGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCC TAAAGGAAGATCCAGCAAATACGGT T TAC TCCACTGTGGAAATACCGAAAAAGAT 

GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TC TAGACAGCAGT GCAC T CCCC TAAGTCTCT GC TCA 
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FIGURE a6 



m 

m 



MAGSPTCLTLIYILWQLTGSAA.SGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLYTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGS I LPI SWRWGESDMTFI C 
VARNPVSRNFS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLLS L FVLGLFLWFLKRERQEE YI E 

EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 4Z 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGAT GGCCAT TCCAGCAACAACAATGTCC T TGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT TGTGAATT TTCAT T GAAAAACATCAGT GACAT TCAT CCAGAATCCT TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGTGGC T GGAGAGCATC TAGT T TCCAC T T CGAT TC TGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TGTAGT TTAATGGGAATAAAATGTAAGTATCAGTAGTTTG A A A AAA A A A A A 
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FIGURE a8 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE aq 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT TTGCCAATAAAGACGAT CCC T T CTACTATGAC T GGAAAAACCTGCAGCTGAGCGGACT GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCC T GTACCT GAGAAGGCCATCCCACTCATCAC TCCAGGCTCT GCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



APP_ID=1 0063567 



Page 201 of 320 



FIGURE so 



mervtiallllagltaleandpf™^ 
s qkqhs pvpekai pl i t pgs at tc 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 




N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7. 

amino acids 54-67 
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FIGURE si 



GTGGACT C TGAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC TGCAGACC CAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCC TTGGCCAAGGGACCAGAGAAGCAGT TGGCAC TGGAGTCAGGCAGGT TCCAGGCT TT GGCGC 

AGCAGAT GCTT TGGGCAACAGGGTCGGGGAAGCAGCCCAT GC TC TGGGAAACAC TGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CTTCAGAGGACAGGGAGT T T CCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCC T CCT TG 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
^^^A^AAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE S2 



m 



MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKV-S 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHG^AVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSS SGGS S GGS SGGS SGNS GGS RGDS GSE S S WGS S TGSS SGNHGGS GGGNGHKPGCEKPGNE 

ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



13 Signal peptide: 

%M amino acids 1-21 

w 

IP 



N-glycosylation site. 

amino acids 2 65-2 69 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 

CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 

ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAAC7\ACTGCCGCCGGCTCCAGTGTTTCC 

CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 

TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 

CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 

TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 

AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 

GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 

CCT CAGAGGGCAGCAGTCGTC TGGACAT GT T T GAGCACAT CAGCCT CAT GACC T TGGACAGTCTA 

CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 

C TTGGAGCTCAGTGCCCT TGTAGAGAAAAGAAGCCAGCATAT CCTCCAGCACATGGACT T T CTGT 

ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 

CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 

GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

ACAGGAGGT GCAAGAGCT TC TGAAGGACCGCGATCCTAAAGAGAT TGAATGGGACGACCT GGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 

CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 

GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 

GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 

GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 

CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 



fr:; 



111 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARIL 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVIjYNIARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAFI PFSAGPRNC I GQAFAMAEMKVVLALMLLHFRFLPDHTE PRRKLEL IMRAEGGL WLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 



11 amino acids 461-471 

Ill 

W N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE ss 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
fa GTGATT CGAACAT T TGAGGGT TAC TTTT GGAAGCAACAATACAT TC TCGAACC TGAATGTCAGTA 

Cl GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
Cf TCTGGATGT TGTCCCACTGAATTCCCATGAATACAAACCTAT TCAGCAACAGCAAAAAAAAAAAA 

f \ ^^^^ 

-S 

C3 
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FIGURE 56 

MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCS AFWWHNKGLAL I FC I LQSLALTWYS LS FI PFAR 
DAVKKC FAVCLA 



Important features: 
Signal peptide: 

amino acids 1-33 

i: 

Q Type II fibronectin collagen-binding domain protein. 

W amino acids 30-72 

m 
W 
m 
m 

S3 

m 
o 

• 

a 
ru 
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FIGURE *7 

CG GCT CGAGCTCGAGCCGAATCGG CTCGAGGGGCAGT GGAGCACC CAGCAGGCCGCCAACATGCT CT GTCT GT GCCT G 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTT TCAAGCTCAGT GT CT T CAT C C CCTC CCAGGAAT T CTC CACCTACCG CCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTT GATGGGCAGCTAGACTTT GAAGAATTT GT CCATTATCT CCAAGATCATGAGAAGAAGCT GAGG 
CT GGT GTT TAAGATTTTGGACAAAAAGAAT GAT GGAC GCATT GACGCGCAGGAGATCAT GCAGTC CCT GC GGGAC TTG 
GGAGTCAAGATAT CTGAACAGCAGGCAGAAAAAATTCT CAAGAGCAT GGATAAAAACGGCAC GATGACCATCGACTGG 
AACGAGTGGAGAGACTACCACCTCCTC CACCC CGT GGAAAACATCCC C GAGAT CATCCT CTACT GGAAGCATT CCACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCATGCCTC CCGCAGCAACAACAT GGGCAT CGTT GGTGGCTT CACTCAGATGATT CGAGAAGGAGGGGC CAGG 
TCACT CT GGC GGGG CAAT GGCAT CAACGTCCT CAAAATT GCCC C CGAAT CAGC CATCAAATT CATGGCCTATGAGCAG 
AT CAAGCGCCTTGTTGGTAGTGAC CAGGAGACTCTGAGGATT CACGAGAGGCTT GTGGCAGGGT CCTT GGCAGGGGCC 
AT CGCCCAGAGCAGCATCTACCCAATGGAGGT CCT GAAGACCC GGAT GGC GCT GCGGAAGACAG GCCAGTACT CAGGA 
ATGCT G GACTGCGCCAGGAGGATCCTGGC CAGAGAGGGGGTGGCC GCCTT CTACAAAGGCTATGT CC CCAACAT GCTG 
GGCAT CATCCC CTAT GCCGGCAT CGAC CTT GCAGTCTACGAGACGCT CAAGAAT GCCT GGCT GCAGCACTATGCAGT G 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGT GGTCTAC GAGAACCT GAAGATCACCCTGGGC GTGCAGT C GCGGTGAC GGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCT GTCTCGAGCCAAGCTGT GAAAACC CTAGAC GCAC CCGCAGGGAGGGT GGGGAGAGCT GGCAGGC C CAGGGCTT 
GT CCT GCTGACCCCAGCAGAC C CT CCTGTTGGTT CCAGC GAAGACCACAGGCAT TCCTTAGGGT CCAGGGTCAGCAGG 
CT CCGGGCT CACAT GT GTAAGGACAGGACATTTT CTGCAGT GCCT GCCAATAGT GAGCTT GGAGCCT G GAGGCC GGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAAT CCAT GATGAAAGGT GAGGTCAC GT GGCCTCCCAGGC CTGACT TCC CAAC CTACAGCATT GAC GC CAACTT GGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACTGT T GGGAAAAGGGT TTT GTCCAGAAGGACAAGC CGGACAAAT GAGC GACTTCT GTGCTT CCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGAC CAGC CCCACATT CCACTTGT GT CACT GCTTGGAAC CTATTTATTTT GTATTTATTT GAACAGAGT TATGT CCT 
AACTAT T TT TATAGATTTGTTTAATTAATAGCTT GT CATTT TCAAGT TCATTTTT TATTCATAT TTATGTT CAT GGTT 
GATTGTACCTT CCCAAGCCCGC CCAGT GGGAT GGGAGGAGGAGGAGAAGG GGGG C CTTGGGC CGCT GCAGT CACAT CT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTTCAATAGGAT GCAAAGAT CAAT GCAAAAAT T GTTATATAT GAACATATAACTGGAGTC GT CAAAAAG 
CAAATTAAGAAAGAAT T GGACGTTAGAAGTT GT CATT TAAAGCAG C CTTCTAATAAAGTTGTTTCAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNE WRDYHLLHPVENI PE I IL YWKHS T I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 

AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGI I PYAGIDIAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMSSLFKHI LRTEGAFGLYRGLAPNFMKVI PAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE xq 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CAC T CAT CAT TGGC T TT GGT AT TTCAGGGAGACAC T CCAT CACAGTCAC TACTGTCGCCTCAGC T 
GGGAACAT T GGGGAGGATGGAATCCTGAGCT GCAC T TT TGAACC TGACAT CAAACT T TCTGATAT 
CGTGATACAATGGCTGAAGGAAGGTGT T T TAGGCTT GGT CCATGAGT TCAAAGAAGGCAAAGATG 
AGCTGT CGGAGCAGGAT GAAATGTTCAGAGGCCGGACAGCAG TGT T TGCT GAT CAAGTGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT CAC T T CTAAAGG C AAGG GGAAT GC T AAC C T T GAG T ATAAAAC T GGAGC C T T CAGC AT GC C GG 

AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATAC TCC T GTATGAT TGAAAATGACATT GCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCT T GGCCACAAAAAAGCATGCAAAGTCAT TGT TACAACAGGGATCTACAGAAC TAT T T CAC 
CACCAGATAT GACC TAGTT T TATATT TC TGGGAGGAAATGAAT TCATAT CTAGAAGTCT GGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATCT T CAAAGACATAT TAGAAGT TGGGAAAATAAT T CATGTGAAC TAGACAAGTGTGT TAAGA 

GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGC TGTAGTAT GTACCC TAAGACGC TGC TAAT TGAC TGCCACT TCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TC T TCCCAACTGACAAATGCCAAAGT T GAGAAAAATGATCATAAT T TTAGCATAAACAGAGCAGT 
CGGGGACACCGATT TTATAAATAAACTGAGCACCT TCTTTT TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 

D I VI QWLKE GVLGL VHE FKE GKDEL SE QDEMFRGRTAVFADQVI VGNASLRLKNVQLTDAG T YKC 

YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 

NTSFELNSEWTMKWSVLYWTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI SWALLPLS PYLMLK 



Important features: 
Signal peptide: 

amino acids 1-28 



1=5 Transmembrane domain: 

fl% amino acids 258-281 

%| 

% N-glycosylation sites. 

J!! amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 

«4 220-224 

its 

O N-myristoylation sites, 

*w amino acids 52-58, 126-132, 188-194 
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FIGURE 6l 



T GACGT CAGAATCACCATGGCCAGC TAT CC TTACCGGCAGGGC TGCCCAGGAGC T GCAGGACAAG 

CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 

GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 

TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 

GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 

GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 

GGT GGACTCAGATCACAGT GGC TATAT CT CCAT GAAGGAGCT AAAGCAGGCCCTGGTCAACTGCA 

ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 

GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

CCAGCAGTATGACCGGGACCGCTCGGGC T CCAT TAGC TACACAGAGC TGCAGCAAGCTCT GTCCC 

AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGCCTT CCGGGAGAAGGACACAGC TGTACAAGGCAACATCCGGC TCAGCTTCGAGGACT T CGT CA 

CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 

TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

AT TCTCCCTTGC T TGATGCAACAC TGTT CCAAAAGAGGGT GGAGAGTCCT GCATCATAGCCACCA 

AATAGT GAGGACCGGGGC TGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGT T GA 

ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 

CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 

CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 

CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 

TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 

AT T TGGGGCCAAAAGT CCAGTGAAAT T GTAAGCT TCAATAAAAGGATGAAAC T CTGA 



ID=10063567 



Page 213 



FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALVNCNWSSFNDETCLMMIl^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNI RL S FEDFVTMTAS RML 



Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Jg. N-glycosylation site. 

m 

*si ammo acids 147-150 

'Si 

% Casein kinase II phosphorylation sites. 

W amino acids 135-138, 150-153, 202-205, 271-274 

ill 



I'll 



N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6<* 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCT GGGGGAATATTCAT C TATACCCAGGTATCAATAACC CAT T TCC T CC TGGAGTTC TCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGC TCATCTCCAATAAAATAAAAGCACT TATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
^^^^ 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDIJ^VPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQtMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



Important features of the protein: 
O Signal peptide: 

*J amino acids 1-26 

III 

IP Casein kinase II phosphorylation sites. 

ff! amino acids 56-59, 155-158 



N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACAC ATGA CAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 




'S. 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



Important features: 
Signal peptide: 

amino acids 1-19 

M 

D N-glycosylation site. 

o 

::: amino acids 37-41 

III 

W 

Iff N-nryristoylation sites. 

Ill amino acids 15-21, 19-25, 60-66 

H 
s 

o 
in 
o 
ni 

13 

ru 
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FIGURE 67 

ACGGACC GAGGGT TCGAGGGAGGGACACGGACCAGGAACCTGAGC TAGGT CAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGT GCCC TCTC TGGGCAGTCAGAT CCAC CCAGTGC T TAATAGCAGGGAAGAAGGTAC T TCAA 
AGACTC TGCCCCT GAGGT CAAGAGAGGATGGGGC TAT TCACT T TTATATAT T TATATAAAAT TAG 
TAGTGAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLIJ\RWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 

GCCAGGAATAACTAGAGAGGAACAATGGGGTTATT CAGAGGTTTT GTTTT CC T CTTAGTT CTGTGCCTGCTGCACCAG 
TCAAATACTT CCTTCATTAAGCTGAATAATAAT GGCTT TGAAGATATTGT CATT GTTATAGAT CCTAGTGTGCCAGAA 
GAT GAAAAAATAAT T GAACAAATAGAGGATATG GTGACTACAGCTTCTACGTACCTGTTTGAAGC CACAGAAAAAAGA 
TTTTTTTT CAAAAAT GTAT CTATATTAATT CCT GAGAATT GGAAGGAAAATCCTCAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACAT GCT GATGTTATAGTT GCACCAC CTACACTCC CAGGTAGAGAT GAAC CATACAC CAAGCAGT T CACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTT T GT CCATGAGTGGGCTCACCT CCGGT GGGGAGT GT TTGAT GAGTACAAT GAAGATCAGCCT TT C 
TACCGTGCTAAGT CAAAAAAAATC GAAGCAAC AAG GTGTT CCGCAGGTAT CT CTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GTCTTAGTAGAGCATGCAGT^AT TGATT CTACAACAAAACTGTATGGAAAAGATT GT CAATT CTTT 
CCT GATAAAGTACAAACAGAAAAAGCATCCATAAT GT TTATGCAAAGTATTGATT CTGTTGTT GAATTTTGTAACGAA 
AAAAC CCATAAT CAAGAAGCTC CAAGC CTACAAAACATAAAGT GCAATTTTAGAAGTACATGG GAGGT GATTAGCAAT 
TCT GAGGATTTTAAAAACACCATACCCAT GGT GACACCACCTCCT CCACCTGTCTTCT CATT GCT GAAGATCAGT CAA 
AGAAT TGT GT GCTTAGTT CT TGATAAGT CTGGAAGCAT GGGGGGTAAGGACCGCCTAAAT CGAAT GAATCAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAGCTAATCCAAATAAAAAGCAGT GATGAAAGAAACACACT CAT GGCAGGATTAC CTACATATCCT CTGGGAGGA 
ACTT C CATCT GCT CTGGAATTAAATAT GCATTTCAGGTGATT GGAGAGCTACAT TCC CAACT CGAT GGAT CCGAAGTA 
CT GCT GCTGACT GAT GGGGAGGATAACACT GCAAGTT CT T GTATTGATGAAGT GAAACAAAGT GGGGCCATT GTTCAT 
TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAAT GCCTGGAT GAACGACACTGT CATAAT TGATAGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
ATAAT GGAAAATTT CACAGTGGAT GCAACTTC CAAAATGGCCTAT CTCAGTATT C CAGGAACT GCAAAGGTGGGCACT 
TGGGCATACAAT CTTCAAGC CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTT CTCGAGCAGCAAATTCT TCT 
GT G C CTCCAATCACAGT GAATG CTAAAAT GAATAAGGACGTAAACAGTT TCC CCAGCCCAATGATTGTTTACGCAGAA 
ATTCTACAAGGATAT GTAC CTGTT CT T GGAGCCAATGTGACTGCT TTCATT GAATCACAGAATGGACATACAGAAGTT 
TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
ACAGAAAAT GGCAGATATAGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACT GCCAGGCTAAAATTAC GGCCT C CA 
CT GAATAGAGCC GCGTACATACCAGGCTGGGTAGT GAACGGGGAAATTGAAGCAAAC CCGCCAAGACCT GAAATT GAT 
GAGGATACT CAGACCACCTTGGAGGATTT CAGCC GAACAGCATCCGGAGGT GCATTTGT G GTATCACAAGT CCCAAGC 
CT TCC CTTGCCT GAC CAATACCCAC CAAGT CAAAT CACAGACCTTGATGC CACAGTTCATGAGGATAAGAT TATTCTT 
AC^TGGACAGCACCAGGAGATAATTTTGATGTTGGAAAAGT^ 

GATCTAAGAGACAGTTTTGAT GATGCT CTTCAAGTAAATACTACT GATCTGT CACCAAAGGAGGCCAACTC CAAGGAA 
AGCT TT GCATT TAAACCAGAAAATATCTCAGAAGAAAAT GCAACCCACATAT TTAT TGCCATTAAAAGTATAGATAAA 
AGCAATTTGACATCAAAAGTAT CCAACATTG CACAAGTAACTTT GTT TAT C C CTCAAGCAAATCCTGAT GACAT T GAT 
CCTACAC CTAC T CCTACTCCTACTCCTACT CCTGATAAAAGTCATAATT CT G GAGT TAATATTTCTACGCTGGTATT G 
T CT GT GATTGGGTCT GTTGTAATT GT TAACTTTATTTTAAGTACCACCATTTG^C CTTAACGAAGAAAAAAAT CT T C 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATT TCT GAAT CTTAAAATT CATCC CAT 
GTGT GATCATAAACT CATAAAAATAATTTTAAGAT GT CGGAAAAGGATACTTTGATTAAATAAAAACACT CATGGATA 
T GTAAAAACT GT CAAGATTAAAATTTAATAGTTTCAT TTATTT GTTATT TTATT T GTAAGAAATAGTGATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTATCT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSECKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRS TWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFWSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANS SVPPITVNAKMNKDVNS FPS PMI VYAE ILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYI PGWWNGE I EANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI I RI SAS I LDLRDS FDDALQVNTTDLS PKEANSKES FAFKPENI SEENATHI FIAI 

KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT C CTTAGGT GGAAACC CTGGGAGTAGAGTAC TGACAGCAAAGAC CGGGAAAGACCATACGT CCCCGGGCAGGGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAG CCACCT CCTAAC CTT CCCTT CTTGAACCCCCAGTTAT GCCAGGATTTACTAGAGAGT GTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTT G C CTGAGGGT G GACCAGAAGAAAGGAAAGGT CC CCT CT TGCT GTT GGCTGCACAT CAGGAA 
GGCTGTGAT GGGAAT GAAGGTGAAAACTT GGAGATTT CACTTCAGTCATT GCTTCTGC CT GCAAGAT CAT CCT TTAAA 
AGTAGAGAAGCT GCT CTGTGT GGT GGTTAACT C CAAGAGGCAGAACT C GTTCTAGAAGGAAATGGAT GCAAG CAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACT GCCCAGGGCCAACAGCC CCACG GGGAAGGAGGGGTACCAGGCCGT CCTTCAGGAGT GGGAGGAGCAGCAC 
CG CAACTACGTGAGCAGCCT GAAGC GGCAGAT CGCACAGCTCAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
l*it AATGGGCAGTACCAAGCCAG CGATGCT GCTGGCCT GGGT CTGGACAGGAGCCC CCCAGAGAAAACCCAGG CCGACCT C 

hi CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
**? CCTT TCGATAGCTTTACT CTACAGAAGGT GTACCAGCT GGAGACT GGC CT TAC CC GC CAC CC CGAGGAGAAGCCT GTG 

Q AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
CCCAATCACCGT CCT TACACGGCCT CT GAT TT CATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACAT T GTAT 
yi GAGCTCACCTTCAAAGGGGACCACAAACACGAATT CAAACGGCT CAT CTTATTT CGAC CATTCAGCCCCATCAT GAAA 

III GTGAAAAATGAAAAGCT CAACATGGCCAACACGCT TATCAATGT TAT C GT GCCT CTAGCAAAAAGGGT GGACAAGTT C 

*f3 C GGCAGTT CATGCAGAATTTCAGGGAGAT GTGCATTGAGCAGGATGGGAGAGTCCATCT CACTGTT GTTTACT T TGGG 

Z ? J AAAGAAGAAATAAAT GAAGT CAAAGGAATACTTGAAAACACTT C CAAAGCTGCCAACTT CAGGAACTTTACCTT CAT C 

IP CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
SA TT TTT CT GT GAT GT GGACAT CTACTTCACAT CTGAATTCCTCAATACGTGTAGGCT GAATACACAGC CAGGGAAGAAG 

^ GTATTT TAT CCAGTT CTTTT CAGTCAGTACAATC CTGGCATAATATACGGCCACCATGAT GCAGT CCCTC CCTT GGAA 

* CAGCAGCTG GTCATAAAGAAGGAAACTGGAT TTT GGAGAGACTTT GGAT TTGGGATGACGT GTCAGTATC GGTCAGAC 

f*| TTCATCAATATAGGT GGGTTT GATCT GGACATCAAAGGCT GGGGC GGAGAGGATGTGCACCT TTATC GCAAGTAT CT C 

hf CACAGCAACCTCATAGT GGTACGGACGCCT GT GCGAGGACTCTT CCACCTCT GGCAT GAGAAGCGCTGCAT GGACGAG 

CT GAC C CCCGAGCAGTACAAGATGT GCAT GCAGT CCAAGGC CATGAACGAGGCAT CCCAC GGCCAGCTGGGCAT GCTG 
f*% GTGT TCAGGCACGAGATAGAGGCTCAC CTT C GCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGA 

GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAG^ 
ill GGACGACAAAAGAAT T GGACT GATGGGTCAGAGATGAGAAAGCCT CCGATTT CTCTCTGT TG GGCTTTTTACAACAGA 

AATC7VAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
if AGATTATAAGCCTAAT GGTGT GGAGGTTTT GATGGTGTTTACAATACACT GAGAC CTGTT GTTT T GT GTGCT CATTGA 

fU AATATTCAT GATTTAAGAGCAGTTTT GTAAAAAAT T CATTAGCAT GAAAGGCAAGCATATT T CT CCT CATATGAATGA 

GCCTATCAGCAGGGCT CTAGT TT CTAGGAATGCTAAAATAT CAGAAGGCAGGAGAGGAGATAGGCT TATTAT GATACT 
AGTGAGTACAT TAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATAT C GT GT CATAT TTT C CCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
ATTTTTATATT TTT TAAGAAGATACTTTGAGATGCATTATGAGAACT TT CAGTT CAAAGCAT CAAATT GATGCCATAT 
CCAAGGACATGCCAAAT GCTGATT CTGT CAGGCACT GAATGTCAGGCAT T GAGACATAGGGAAGGAAT GGTTT GTACT 
AATACAGACGTACAGATACTT TCTCTGAAGAGTATTT T C GAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACTT TCT GCTTTACAGAAAAGGAAACT CATTCAGACT GGTGAT ATCGTGAT GTAC CTAAAAGT CAGAAAC CACATTTT 
CT CCTCAGAAGTAGGGACC GCTTT CTTAC CT GTT TAAATAAACCAAAGTATACCGT GT GAAC CAAACAATCTCT TTTC 
AAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATATATATATATATATTGT 
GAAAGAT CAAT C CAT CT GCCAGAATCTAGT GGGATGGAAGTTTT TGCTACAT GT TATCCACC CCAGGCCAGGT G GAAG 
TAACTGAATTATTT TTTAAAT TAAGCAGTTCTACTCAAT CAC CAAGATGCTT CT GAAAATT GCATTTTATTACCATTT 
CAAACTATTTTTTAAAAATAAATACAGT TAACATAGAGT GGTT T CTTCATTCAT GTGAAAATTATTAGC CAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCAC^GTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATT CAAGCT GT TGGTGT GTTAAAAAAT GCATT GTATT GATTTGTACT GGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGT GGTATT GCACAGCTAATAAAATAT GATTT GT GGATAT GAA 
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FIGURE 72 



MMMVRRGLLAWISRVVVLLVLLCCAI SVLYMLACTPKGDEEQLALPRAiSrSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRPYTASDFIEG I YRTERDKGTLYELT FKGDHKHE FKRL I L FRPFS P IMKVKNEKLNMAN 
TLIOTIVPIAKRVDKFRQFMQNFREMCIEQ 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
K YLHSNL I WRT P VRGL FHL WHE KRCMDE LT PEQYKMCMQSKAMNE AS HGQLGML VFRHE I E AHL 
RKQKQKTSSKKT 



HI Important features: 

pi Signal peptide: 

'H amino acids 1-27 

tZ N-glycosylation sites. 

Ill 

$*% amino acids 315-319, 324-328 

CI N-myristoylation sites. 

Wi 

ammo acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 



Amidation site. 

amino acids 377-381 
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FIGURE 7* 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACT GGACGTAT GAGGGCCCACAT GGT CAGGACCAT T GGCCAGCC T CT TACCCTGAGT GT 
GGAAACAAT GCCCAGT CGCC CATCGATAT TCAGACAGACAGT GTGACAT T TGACCC TGAT T TGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACAT TTGCAGAGCTCCACATTGTACAT TATGACTCTGAT TCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACAT TCTGAGTCACT TGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATT TCAATGGAACAGCT GGAAAAGC T TCAGGGGACATT GT TC TCCACAGAAGAGGAGCCC TC TAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGT CAT T CCAGGAAGAAC TGCAGAGCCT TCAGCCT C TC CAAACATGTAGGAGGAA 
ATGAGGAAAT CGC TGTGTT GT TAATGCAGAGANCAAAC T CT GT T TAGT T GCAGGGGAAGT TT GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCT TAGGATAAAGAGTT GC TGTTGAAGT T GTATAT TT TT GAT CAATATAT T TGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRiytVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCT GGT GGT TT GCC TAAACC TGCAAACATCACC T TC TTAT CCATCAACAT GAAG AATG TCC TACA 
ATGGACTCCACCAGAGGGTC T T CAAGGAGT TAAAGT TACT TACACT GTGCAGTATT TCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCT T CCTGT T TCCATGCAACAAATATACT CCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
* AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
% CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
fff TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
||| GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
111 AC T T TATCACCCTCAATATCTCGGATGAT TCTAAAAT T T CTCATCAGGATATGAGT T TACTGGGA 

%f : AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
Q AAAACACGGAAGGTAC T TCTCT CACCCAGCAAGAGT CCC T CAGCAGAACAATACCCCCGGATAAA 

ACAGT CATT GAATAT GAATATGAT GTCAGAACCAC T GACATT TGTGCGGGGCC TGAAGAGCAGGA 
GCT CAGT TTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

is! 

CAGGAGCACACAGAC TCGGAGGAGGGGC CGGAGGAAGAGCCAT CGACGACCC TGGT CGAC TGGGA 

H? 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TGT GCAGATGGAAAACTGATGCCAACAC T TCC T T T TGCC TT TT GT T TCCTGT GCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 



-■■ ss: 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCS I SSQI GPPEVALTTDEKS I S WLTAPEKWKRNPEDLPVSMQQI Y 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKI I FWYVLP IS I TVFLFSVMGYS I YRYIHVGKEKHPANL ILI YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCC TCAGTCCCAC T GCAGTTCT CATCC T CGGCCCAAAAGT CATCAAAGAAAAGC TGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCT GGGATC TACCCAGAT CGTGAAGATCCTAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGG TGACCAACTTATACTCAAC TT GAATAACATCAGCT CTGAT CGGATCCAGC T G 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 

GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVI E AS FNGMYADLLQLVKVP I SLS I DRLE FDLLYPAIKGDT I QL YLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 

Id, KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 

O FEAAE S S LTKDALVL TPAS LWKPS S PVS Q 

O 

J I Important features of* the protein: 

%n Signal peptide: 

ffl amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 



:i 7 



Q 

|i| Glycosaminoglycan attachment site. 

W amino acids 412-415 

is; :! 

Ill 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGT CAGCCTGGCAGAGAGACT CTGAAATGAGGGAT TAGAGGTGT TCAAGGAGCAAGAGC 
T TCAGCC T GAAGACAAGGGAGCAGT CCC T GAAGACGC TTC TAC T GAGAGGT CT GCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCT CTGGATGGAATGTGCCACACACAGCACAGGCAT CACCCAGT GT GACAT C TATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTC T TGCCACAAGGAGC T CTCCAAGGCC TGGTCAACCTC CCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACT GACTTTGGCCAT TGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT TT GTGCATGAAC TGAAATA7\AACCATCC TACGGTAT CCAGGGAACAGAAAGCAGGATG 
CAGGAT GGGAGGACAGGAAGGCAGCCT GGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPADI QAAQAMMVTS SAI S SLACI I SWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFI PVAWNLHGILRDFYS PLVPDSMKFE IGEALYLGI IS SLFSLIAGI I LCFSCS SQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
T CCCT CT GGAC TCCGGGCCCCACCAAAGT CC TGCCAGCACAACGGGACCAT GTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACT GCCAGACTCCTGCTGCCAAGCC T GCAAAGATGAGGCAAGT GAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACT TGGTGGAGATC TACCT C TGGAAGCTGGTAAAAGAT GAGGAAAC TGAGGC TCAGAGAG 
GT GAAGTACC T GGCCCAAGGCCACACAGCCAGAATC T TCCAC T TGACTCAGAT CAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACC TA&CAGTTGCAGAT ATGAGC TGTATAATT GT T G TTAT TATATAT TAATAAA 
TAAGAAGT T GCAT TACCC TCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGT GTC TCGATGGAGTAGAC TCT C AGAACAGCGC AGTT TGCCCTCCGC TCACGC AGAGCCTC TCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TC CGT T TCCATGCCGTGAGGTCCATT CAC AGAAC AC ATCCATGGCT CT CATGCT CAGT TTGGT T CTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGGAC AAAACTGGTG AAGG AT T C TATT GC GGAGGGGCGC ATCTCT CT GAGGCT GGAAAACATTAC T 
GT GTTGGAT GCT GGCCTCT ATGGGTGCAGGAT T AGTTCCC AGTC TT ACT ACC AGAAGGCC AT CTGGGAGC T 
AC AGGTGT CAGCACTGGGCT CAGT T C CT CT CAT TTCCATC ACGGGATATGTT GATAGAGACATCCAGCTAC 
TCTGT C AG T CCT CGGGCT GGTTCCCC CGGCCC ACAGCGAAGT GGAAAGGT CC AC AAGGACAGGATTTGTC C 
ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
CGCCGGGAGCAT AT CCTGTT CC AT GCGGCATGCT CATCTGAGCC GAGAGGTGGAAT CC AGGGT ACAGATAG 
GAGAT ACCTTTT TC GAGCCT AT AT CGTGGC ACCTGGCT ACCAAAGT ACTGGGAATACT CTGCTGTGGC CT A 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCT CACCCGAAGCT C TGCGTT TCTGAT C TGAAAACT GTAACCC AT AGAAAAGCTCCCC AGG AGGT G 
CC TCACT C T GAGAAGAGATT T ACAAGGAAGAGTGTGGT GGCTTCTCAGAGTT TCCAAGCAGGGAAACATT A 
CT GGGAGGTGGACGGAGG ACACAAT AAAAGGT GGCGCGTGGGAGTGTGCC GGGATGAT GTGGAC AGGAGGA 
AGGAGT ACGT GACT T T GT CT CCCGAT CAT GGGTACTGGGTCCTC AGACTGAATGGAGAAC ATTT GT AT T T C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACT^AAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CC AGT C ACCC AGGAAT CAGAGAAAGAGGCCTC TT GGCAAAGGGCCT CTGC AATCCCAG AGAC AAGC AACAG 
TGAGTCCT CC TCAC AGGCAACCACGC CCTTCCTCCCC AGGGGTGAAAT G TAGG ATGAAT C ACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GAT GAAGGGGGACTGGCCTGTCCACATGGGAGTC AGGT GTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGG AATT CCC ATC TCACAGGCTGTGGT GT AG AT T AAGTAGAC AAGGAAT GT GAAT AATGCTT AG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 
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FIGURE 8a 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRI SSQS YYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE YVTLS PDHGYWVLRLNGEHL YFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I VI CPVTQESEKEAS WQRASAI PE TSNSES S SQAT TP FL PRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8* 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGC GT GGGAGATACGGGCATAGAGGAT GCAAACGC T GTCAGGGGT T C 
AGCCTCTCAGGGGCCCCTGACTGAAGCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACT GCAGAGAC TCACCC TGAT TGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCT GAT TC T T GTAGAAT TAACAGCCCT CAA.CGTGATGAGC TATGATAACAC TATGAAT TATG 
T GCAGAGTGAAAA.GCACACAGGCT T TAGAGTCAAAGTAT CTCAAACCTGAAT CCACACT GTGCCC 
T CCCT TT TAT T T T T T TAAC TAAAAGACAGACAAA.T TCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRS DAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGS QQVYLNVSLQSKATS GVTQGWGGAGATALVFLS FCVI FVWRS CRKKSARPAAGVGDTGIE 
DTySFAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGAC TCAGACAAGAT TACAATGAACCAAC T CAGCT TCCTGCTGTTTC 
T CATAGCGACCACCAGAGGATGGAGTACAGAT GAGGC TAATACT TACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACC T T TGGAT C T GCAGAGG C GGC C ACGAGC GAT GAC TACAAGAACCC T GGC T AC T 
p ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
if! AGAAACAGCT CCC TGCTGAGGTACCGCACGGACAC TGGC T TCCT CCAGACACTGGGACATAATCT 

W GT T T GGCATCTACCAGAAATAT CCAGTGAAATATGGAGAAGGAAAGTGTT GGAC TGACAACGGCC 

CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAAT TCAC T GCGGGATT TGT TCAGT TCAGGGTAT T TAATAACGAGAGAGCAGCCAA 
s CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
O GAT ACT T TCCAGAGGCCAGTCCCCAGCAGTGT GGAGAT TTTTCTGGTTTT GAT TGGAGTGGATAT 

Wl GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
pl T TGAGAGT TT TGTGGGAGGGAACCCAGACCT C TCC TCCCAACCAT GAGAT CCCAAGGATGGAGAA 

lZ CAACTTACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 
ri| AAAAAA 

3 T-;i r 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FCDMT S GGGGWTLVAS VHENDMRGKCT VGDRWS S QQGS KAD YPE GDGNWAN YNT FGSAEAAT 

SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACTTATATACAGTTTTAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE oo 



MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 

VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

U 

O 
O 
If! 
W 
III 
CP 
H 

O 
US 
O 
ill 
O 

ru 
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FIGURE qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGA^AACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
JT CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
g AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
CO GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
W CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
jjj TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
* ^GGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
O AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
|j AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
|=; AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
p TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
f|| CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWG I LGQPPNRPNHS PPPS AKVKKI FGWGDFYSNI KTVALNLLVTGKI VDHGNGT F 
SVHFQHNATGQGNI S ISLVPPSKA.VEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyltransferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCT GCTGATC T T TGGAGCGT T T GT CTCT GTC TATATCCAAGAA 
ATGTTCCGAT T TGCATAT TATAAACTCTTAAAAAAAGC CAGTGAAGGT T T GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCT GAAAC TC TGCC TGCTCTGCCAAGACAAGAAC TT TCT T CT T TACAACCAGCGC TCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
T TT T CT GAAAATCCC TT T T TCTGGTGGAATT GAGAAAGAAATAAAACTAT GCAGATA 
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FIGURE Q J. 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRI I FLIAGAFFWLVSLLISSLVWFMARVI IDNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKPCWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 95 

AAT T T TT CACCAGAGTAAACTT GAGAAACCAACTGGACC T TGAGTAT TGTACAT TT TGCCTCGT G 
GACCCAAAGGTAGCAATC T GAAACATGAGGAGTACGAT TC TACT GT T T TGT CTTC TAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGAT GC TCACAC TGGGGCCAGAT C T GCAT CTGT TAAATCC TGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATT GACCCTGGGAGGGT T GAATGTACAACAGCAACT GCACCCACATGTGT TAC 
CAAT TT T T GT CACACAAC T TGGAGCCCAGGGCACTATCCTAAGC TCAGAGGAAT TGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
T AAT CCAGAT GTCCAGGAT GGAAGCCT T CCAGCAGGAGGAGCAGGT GTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACAT GCCATCGAGGAAGCCACCACAGAAT CAGCAAATGGAAT TCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGAT TAT AT TATGGAATAGATT GAGACACAT TGGATAGTCTTAGAAGAAATTAAT TCT TAAT T 
TACC TGAAAATAT TCT TGAAAT TTCAGAAAATATGT TC TATGTAGAGAAT CCCAACT T T TAAAAA 
CAATAATTCAATGGATAAATCTGT C TT T GAAATATAACAT TAT GCTGCCT GGATGATATGCATAT 
TAAAACATAT T TGGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQI FT S L I IHSL FPGG I L PTS QAGANPDVQDGS L PAGGAGVNPATQGTPAGRLPT PSG 
T DDD FAVT TPAG I QRS THAI EEAT TE S ANG I Q 



Signal peptide: 

amino acids 1-16 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGC TACAGGT CACCC TGGAGATGCAGGATGGACAT GT CT TGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGAT GAGCCAGGCACAGCCAACTCGGAT CTTCGAT TCCACAT CC T GAGCCAGGC TCCAGCCCAGC 
H CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
H ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
X CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
!:f CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
Cfi CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
ly TGCAGAGGGAAACC TC TACGTGACCAGAGAGC TGGACAGAGAAGC CCAGGCT GAGTACC T GCTCC 

III AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
||| ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
^1 ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
; CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
fl CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
O CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
f|| GCC TGGATT GGGAGCCAGAC TCT GGGCATGT TAGAC TCAGACT CT GCAAGAACC TCAGT TATGAG 

CI GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
fgg AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
%U ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCC CT CCGACCCCAT CAGCCGAACCC TCAGGT T C TCCCTAGT CAAT GACT CAGAGGGC TGGC T 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T TCACCCAC TGGAC CATGTCAAGGAAGAAGGACCCGGAT CAACCAGCAGACAGCGTGCCCC T 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCCT GGGAGAGAGCC CAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCC TCCA 
T CT GCCCT GGGGTGGAGGCACCATCAC CAT CACCAGGCATGT CT GCAGAGCCTGGACAC CAAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T SLDHALERT YQLL VQVKDMGDQAS GHQATATVEVS 1 1 E S TWVSLE P I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPT SGSVTLGVLPLRAGQNI LLLVLAMDLAGAEGGFS S TCE VE VAVTDI NDHAPE FI TS Q I G 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI I PVWSHNAQl^QLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVG I LVGTLVAIG I FLI 
LI FTHWTMSRKKDPDQPADS VPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT TCT TCCCAGGAACACAAACGTAGGAGACCCACGC TCCTGGAAGCACCAGCCT TTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATC TAGGACCCAGGCAT CTTGCT T TCCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAAT G 
TTC T CCT TAT GT T TGGTC TACTAT TGCAT T TAGAAGC TGCAACAAAT TC CAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAAC TCTGAGT T CCAT ACAACC TCCAGTGGGAT CAGCACAGCCACCAACTC TGAG 
T T CAGCACAGCGTCCAGT GGGATGAGCATAGCCACCAAC T C TGAGT CCAGCACAACC TCCAGTGG 
GGCCAGCACAGCCACCAAC TCT GAGT CCAGCACACCC TCCAGT GGGGCCAGCACAGTCACCAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AG TAGGGCCAGCAC TGC CACCAAC TC TGAGT C TAGCACAC TCT CCAGTGGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTC TGAGTCCAGCACAACC TCCAGT GGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CT GAGT CCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAAT TCT GAGT CCAGCACACCCTCC 
AGTGGGGCCAACACAGCCACCAACT CT GAGT CCAGTACGACCT CCAGTGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
CACAGT GTCCAGTGGGATCAGCACAGTCACCAAT T CTGAGTCCAGCACAAC CTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGT CCTAACTGGT TCT GGAGGAGACCAGTATCAT CGATAGCCAT GGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATC T TGAAGAAGGTAT TCC TCACCTT TC T T GCC T T TACCAGACAC T GGAAAGAGAATACTATAT 
T G C T CAT T T AGCTAAGAAATAAAT ACAT C T CAT C TAACACACACGACAAAG AGAAGC TGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGOTLLMFGLLLHLEAATNSNETSTSANTGS SVIS SGASTATNSGSS VTS SGVS TAT I SGS 
SYTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGL FAGLFFCVRNSLSLRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCT GGAACTATCAGGC TGAAAAACAGAGT GGGTAC TCT CT T CT GGGAAGC TGGCAACAAAT 
GGAT GAT GTGATATATGCAT TCCAGGGGAAGGGAAAT TGTGGTGCT TCT GAACCCAT GGTCAAT T 
AACGAGGCAGTTTC TAGC TACT GCACGTACT TCATAAAGCAGGACT C TAAAAGC T T TGGAATCAT 
GGTGTCAT GGAAAGGGAT T TAC TT TATAC TGAC TCTGTTT TGGGGAAGC TT TT T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT T ATAACT GGGGATGCAT T T GT TCCT GGAGAAAGAAGTGTCAT TAT CAT GAACCAT CGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
C T T CAT T CATAGGAAAT GGAAGGAT GACAAGAGCCATT TCGAAGACATGAT TGATTAC TT TTGTG 
ATAT T CACGAACCAC TTCAACTCC T CATATT CCCAGAAGGGACTGAT CTCACAGAAAACAGCAAG 
TCTCGAAGTAATGCATT TGCTGAAAAAAATGGACT TCAGAAATATGAATATGTT TTACATCCAAG 
AACTACAGGC T TTAC TT TTGTGGTAGACCGT C TAAGAGAAGGTAAGAACC T TGATGCT GT CCAT G 
ATATCAC TGTGGCGTAT CC TCACAACATT CCTCAAT CAGAGAAGCACCT CCTCCAAGGAGACTT T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT T T T TAT T T TACCGGACAGAGT GTCATTCCAC CT TGCAAGTCTGAAC TCAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT TTGTACAGTC T T GT TAAGT GGT AT TT TATAAT CACCAT T GTAAT C T T TGTGC TGCAAGAGA 
GAATAT T T GGT GGACTGGAGAT CATAGAACT T GCAT GT TACCGACT T T TACACAAACAGCCACAT 
TTAAAT TCAAAGAAAAATGA GTAA GATTATAAGGTTTGCCATGTGAAAACCTAGAGCATAT TTTG 
GAAATGT TCTAAACCTT TC TAAGCT CAGATGCAT T T T TGCATGACTATGT CGAATAT T TC T TAC T 
GC CAT CAT TAT T T G T TAAAGATAT T T T GCAC T T AAT T T T GT G G GAAAAAT AT T GCT ACAAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GG G C C AG AAT AT TAT T AAAC AAT CAT C AG G C T T T T AAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWIlSfNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE IP* 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGC T GTCAT CTGTTT TCATAGT GTGAGAT CAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCGCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACT GGGAAT CTAAGCAGATGCCACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT CCAAAGGGAAAATCCAGGCGGAACT GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
TT T CTGATCTGAAAACTG TAAC CCATAGAAAAGC TCCCCAGGAGGT GCC TCAC TC TGAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGACAT GTCAG T T T GAAGGCT T GT T GAGACCCTATATCCAGCATGCGAT GTAT GACGAGGAAAAG 

GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGC TCCCGACAGGTGGCCCCAGCT TCCTCT 

CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGT GAT T CCTGCCTCACAGGT GAAGAT TAAAGAGACAACGAAT GT GAATCATGC TTGCAGGT T 
TGAGGGCACAGTGT TTGC TAATGAT GT GTT T TTATAT TATACAT T T TCCCAC CATAAAC TC TGTT 
TGCTTATTCCACAT TAAT T TACT T T TC T CTATAC CAAATCACCCAT GGAAT AGT TAT TGAACACC 
TGCTT T GTGAGGC T CAAAGAATAAAGAGGAGGTAGGAT T T T T CACT GAT T CTATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CTCATTAACACAGACACAAAAAT TCTAAATAAAAT TT TAACAAAT TAAACTAAACAATATATTTA 
AAGATGATATATAACTACTCAGT GTGGT T TGTC CCACAAATGCAGAGT T GGT TTAATAT T TAAAT 
AT CAACCAGTGTAAT TCAGCACAT TAATAAAGTAAAAAAGAAAAC CATAAAAAAAAAAAAAAA 



ID=1 0063567 



Page 255 



FIGURE iQA 



MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHIAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QG FQAGRHYWEVDVGQNVGW YVGVCRDDVDRGKNNVT L S PNNG YWVLRL T TE HLYFT 

FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CCTT C ACAGGAC TCTT CATT GC TGGTTGGC AATGATGT ATCGGC CAGATGTGGT GAGGGC TAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAAT GGTGAAAAAT GC ATTT T ATAAATCT C CATTAAGGGAAGAATTTGTCAAGT CT CAGGTT ATCAAGTT C 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
- . ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
fs* ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
p= GCCCTGGCAGGC T AGC CT GC AGTGGGAT GGGAGTCATC GCTGTGGAGCAACCTT AAT T AATGCC ACAT GGC 

|fl TTGT GAGTGCTGCT CACT GT T T TACAAC AT AT AAGAAC CCTGCC AGATGG ACT GCT T CCT TTGGAGT AAC A 

|l| ATAAAACCTTCGAAAATGAAAC GGGGTCTCCGGAGAAT AATTGT CCAT GAAAAATACAAACACCCATC AC A 

TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 

^ S TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 

%l 

GATGGT TACAGT C AAAATCAT C TT CGAC AAGC ACAGGT GACTCTCATAGACGCT AC AACTTGCAATGAAC C 
^ TCAAGC TTAC AATGACGCCATAACTCCT AGAATGT T AT GTGCTGGCTCCTT AGAAGGAAAAAC AGATGCAT 

5 « GCCAGGGTGACT CT GGAGGACC ACTGGT T AGTT CAGAT GCT AGAGATATCTGGT ACCTTGCT GGAAT AGT G 

AGCT GGGGAGAT GAAT GTGC GAAACCC AAC AAGCCT GGTGTT TATACT AGAGTT ACGGCCTT GCGGGACT G 
fl % GATT AC TTCAAAAACT GGT ATC TAA GAG AC AAAAGCCT CATGGAAC AGAT AACATT TTTTTTTGTTTTTTG 

O GGTGTGGAGGCC AT TT TT AGAGAT ACAGAATT GGAGAAGACT TGC AAAAC AGCT AGATTTGACTGATC TC A 

fl| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACT CT GT CATC TGT GAG CAAT AGT T GAAACT TT ATGT AC AT AGAGAAAT AGATAATACAAT AT TAC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT AT TAGGT G TTTT TCT T AGTGGAAT AT TAGAAATGATCAT ATTCATT ATGAAAGGT C AAGC AAAGAC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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MMYRPDWRARKRVCWEP WVI GLVI FI S L I VLAVC I GLTVHYVRYNQKKT YNYYS TLS FTTDKLY 
AEFGREASIMFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVT IKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI TPRMLCAGSLEGKTDACQGDSGGPLVS SDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGT TAATGTCCATCAGT GT GT TGGCAGT T TCT GCT T GGAT GAGGGAC TACC TAA 
ATAATGT TCT CACT T TAAC TGCAGAAACGAGGGTAGAGGAAGCAGT CATT T TGAC T TACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGTCAC T T TGAAAGCCAGGAT GACAAAT TATGGAT TACCT AGATATCGGTGGC T TAC TCA 
jU TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
|=| AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
Zli CAGGCCCACCAGGAAGATC TCAGTGACCT T TATCAAGAGGGT T GTGGGAAGAAAATGTAT TC CT T 

!:f T TT GAGAGGAACCAAACAAC TGCAGGTGCT GAGGT T T CTGGGAATCTCCAT TGGGGTGACACAAA 

IP TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
|tf ACAGACCAAATGATGTCC T TGAAGAATGACAACTCTCAGCACCT GTCATGT CCC TCAGTAGAAC T 

| fl GTT GAAACCAAGCCTGTCAAGAATCT TTGAACACACATCCATGGCAAACAGCT TTAATACACACT 

71 TTGAGAT GGAGGAGT TATAAAAAGAAAT GT CACAGAAGAAAACCACAAACT TGT T TTATTGGACT 

J 1 1 TGTGAAT T TT TGAGTACATACTATGT GT T T CAGAAATATGTAGAAATAAAAATGT TGCCATAAAA 

TAACACCTAAGCATATAC TATT CTAT GC TT TAAAAT GAGGATGGAAAAGT T T CATGTCATAAGTC 
i ACCACCT GGACAATAATTGATGCCCT TAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 

J! TGTGTAT GACT T TTAC TGAACACAGT TATGTT TTGAGGCAGCAT GGTT TGAT TAGCAT T TCCGCA 

|s» TCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGT TGATTTACT TCTACCAA 

%! CTAGTATATAAAGTACTAATTAAATGCTAACATAGGAAGTTAGAAAATACTAATAACTTTTATTA 
O CTCAGCGATC TATTCT T C TGAT GCTAAATAAAT TATATAT CAGAAAACTT TCAATAT TGGTGACT 

f|| ACCTAAAT GT GATT T T TGCTGGT TACTAAAATAT TCT TACCACT T AAAAGAGCAAGC TAACACAT 

O TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
f|3 T TCAGT T C TGATAATGT TAAGAATAACCAT TATGAAAAGGAAAAT T T G TCCTGTATAGCATCAT T 

ATT T T TAGCCT T TCC TGT TAATAAAGCT TTAC TAT TCTGTCC T GGGCT TATAT TACACATATAAC 
T GTTAT T TAAATACT TAACCAC TAAT TT T GAAAAT TACCAGTGTGATACATAGGAATCAT TATTC 
AGAAT GTAGT C T GG T C T T TAGGAAGTAT TAATAAGAAAAT T TGCACATAACT TAGT TGATTCAGA 
AAGGACT T GTATGC TGTT T T TC TCCCAAAT GAAGAC TCT T T TT GACACT AAACACT T T T TAAAAA 
GCT TAT CT TTGCCT T C TCCAAACAAGAAGCAATAGTC TCCAAGT CAATATAAAT TC TACAGAAAA 
T AGT GT T CTTT T TC T CCAGAAAAAT GCT T GTGAGAAT CAT TAAAACATGT GACAAT T TAGAGAT T 
CTTTGTTT TAT T TCAC TGAT TAAT AT AC T GTGGC AAATTACACAGAT TAT TAAAT t T T T T TACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TCT CAGAATAT GGAAAGAAAAT TAAAAT GTGTCAATAAAT AT TTTC TAGAGAG TAA 
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MMIEDSWCLRCLLYALNLLFWLMSISVIA^ 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGWTYEQELMVPVQWSDMVT 
LPCARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLS DLYQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVTQI LflMI LT I TLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGTCT T TCT C GT GAGAGC CT AGAGGCCTT AAAAAAAAAAGT GCT T GAAAGAGAAGGGGAC AAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACTGACCACCAGTT GGATGAGTCT CAAGGTCCTCCT CTATGTGACAACC ATGT GAAT GGGGAG 
TGGTACCACTTCACGGGCAT GGC GGGAGATGCCAT GCCT ACCTTCT GC AT ACC AGAAAACCACT GTGGAACCCA 
CGC ACCT GTCT GGCTCAAT GGCAGCCACCCCCTAGAAGGCGAC GGCATTGT GC AACGCCAGGCTTGTGCCAGCT 
TCAATGGGAAC TGCT GTCT CTGGAACACC AC GGT GGAAGTC AAGGCT T GCCCT GGAGGCT ACTAT GTGTATCGT 
CT GACC AAGCC CAGC GTCT GC TT CC AC GT C T ACT GT GGTC AT T T T T AT GACAT CT GC GAC GAGGACT GCCAT GG 
C AGC T GCT CAGAT ACC AGC GAGT GC AC AT GC GCT C C AGGAACT GT GCT AGGC CCT G ACAGGC AGACAT GCT T T G 
ATGAAAATGAATGTGAGC AAAAC AACGGT GGCT GCAGT GAGATCT GTGTGAACCTCAAAAACTCCTACCGCT GT 
GAGT GT GGGGT TGGC C GT GT GCT AAGAAGT GAT GGC AAGAC TT GT GAAGACGT T GAAGGAT GCC ACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGAT AACCACACTT GCCAAGTCCCT GT GT T GT GC AAATC AAAT GCCAT T GAAGT GAACAT C CC C AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCT CTTCTCTCTCAAGAC AT GT GGT AC AGT GGTC GAT GTGGT GAAT GACAAGATT GT GGCCAGC AACCTCGT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACC T GCGAGT T TCCAC GCC T GT ACACC AT T TCT GAAGGAT AC GT T C CC AAC C T T C G AAAC TCCC C AC T GGAAAT 
CAT GAGCC GAAAT CAT GGGAT CT TCCC AT T C ACT C T GGAGATC T T C AAGGAC AAT G AGTT T GAAGAGCCTT ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT GGAAAGCTT GGTGGAGAGCTGCTTT GCCACCCC CACCTCCAAGATC GACGAGGTCCTGAAAT ACTACCTCAT 
CC GGGAT GGCT GT GT TTCAGATG ACT C GGT AAAGC AGT AC ACATCCC GGGAT C ACCT AGC AAAGCACT TCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TT GGACGAGC GTT CC CGCT GT GC CC AGGGT T GCCACC GGC GAAT GC GT C GT GGGGC AGGAGGAGAGGACT CAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGT T C AGAC TT CAC AC T GT GAGT T C AGAC TCCC AGC AC C AACT CAC TCT GAT TC T GGT CC AT TCAGT GGGC A 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT CAGAAGC T GGGT AT AAT AT TT C AAGTT AC AAAC CC T AGAAAAAT T AAAC AGT TACT GAAATT AT GA 
CT T AAAT ACCC AAT GACT C CT T AAAT AT GT AAAT T AT AGT TAT AC CT T GAAAT T T C AAT T C AAAT GC AGACT AA 
TT ATAGG GAAT T T G GAAGT GTAT CAAT AAAACAGT AT AT AATT TT 
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FIGURE no 



pa 



MPPFLLLTCLFITGTSVSPVMjDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHWGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTVVDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 



Important features of the protein: 
Signal peptide: 

f>s amino acids 1-16 



N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



f I* 

N-myristoylation sites. 

fj| amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxy lati on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGC AGCTTGCT CAGC GGAC AAGGAT GC TGGGCGTG AGGGAC CAAGGCCTGCCCTGCACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGG AT CCTGAC AGTGAT C AACCTCT GAAC AGCC TCGATGTC AAACCCCT GCGC AAACCCCGTAT CCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCT C ATCAAGGTGATTCT GGAT AAAT ACT ACTT CCTCT GC GGGC AG CC TCTCCACTTC ATCC CGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
.1^ AGGGAACT GGTT CT CTGCCT GTTT CGAC AACTTC AC AGAAGCTCTCGCTGAG AC AGCCT GTAGGCAGATGG 

u 

-5 =% GC TACAGC AGAGCT GTGGAG ATTGGCCCAGACCAGG AT CT GGAT GT TGTTGAAATCACAGAAAACAGC CAG 

gp GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 

l|j TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

Ifl TC AGCATCCAGT ACGACAAACAGCACGTCT GT GGAGGGAGC ATC CTGGACCCCC AC TGGGTC CTCACGGCA 

Cfl GCCCAC TGCTTC AGGAAACATACCGATGTGTT CAACTGGAAGGT GCGGGCAGGCTC AGAC AAACTGGGCAG 

^ CTTCCC ATCCCT GGCTGTGGCCAAGATC AT CATC AT TG AAT TCAACCCCATGT AC C CCAAAGAC AATGAC A 

* TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 

2« GATGTCTGAC AT ACTGCT GC AGGC GTCAGTCCAGGT CATTGACAGCAC ACGGTGCAAT GC AG ACGAT GCGT 

w 

ACCAGGGGGAAGTC ACCGAGAAGATGATGT GT GC AGGCATCCCG GAAGGG GGTGTGGACACCTGCC AGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
HI CTGCGGGGGCCCGAGC ACCC CAGGAGTATACACCAAGGTCTC AGCCT AT C TC AACT GGAT CT AC AAT GTCT 

GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCC C AAAGTCAGACACAGAGCAAGAGTC CCCTTGGGT ACACCC CT CT GCCCAC AGCCT CAGCAT 
TT CTTGGAGC AGCAAAGGGCCTCAAT TCCT GT AAGAGACCCT CG CAGCCC AGAGGCGCCC AGAGGAAGTC A 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
C AGGGGT ATT GC TAAGCC AAGAAGGAACTT TCCC AC ACT ACT GAATGGAAGC AGGCT GTCTT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GC CT ACT AGAGC AAGAAACC AGTT GT AATATAAAAT GCACTGCC CT ACT GTT GGT ATGACT ACCGT T ACCT 
AC TGTT GT CAT T GTT ATT AC AGCT AT GGCC ACT AT TAT T AAAGAGCTGT GTAAC AT CT CTGGC AAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNS LDVKPLRKPRI PMET FRKVGI P 1 1 1 ALLS LAS 1 1 1 VWL IKVI LDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVBWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWG I VSWGYGCGGPS TPGVYTKVSAYLNWI YNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
T AGGTGTAAGCCACCGT GTCTGGCC T CT GAACAACT T TT T CAGCAACTAA&AAAGCCACAGGAGT 
T GAACT GC T AGGAT T CT GAC T ATGC TGTGGTGGC TAG T GC TC C T AC T CC T ACC T ACAT TAAAAT C 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTC T GC T TAGACCACCT GGT T TATGTGACAGGAC T TGCAT TC T CC TGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAAC T TGTGCCAAAT TAT GGGTCAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGC TGC TGAAGGGCAAC TGCAGGCCGAT GC TCTCATCAGCCAGGCAGCAGCCAAAA 
T CT GCGAT CACCAGCCAGGGGCAGCCGT CT GGGAAGGAGCAAGCAAAGTGACCAT T TC TCC TCCC 
CTCCT TCCCT C T GAGAGGCCC T CCTATGTCCC TACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAAT T TAAAT CATGT TC TAGT AAT TGGAGCT GTCCCCAAGACCAAAGGAGC TAGAGC TT GGT T 
C AAAT GAT C T C CAAGGGC CC T TAT AC C C CAGGAGAC T TT GAT T T GAAT T T GAAAC C C CAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAA^TACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TAC T CGGGAGGCT GAGACAGGAGAAT TAC TTGAAC CTGGGAGGTGAAGGAGGC TGAGACA 
GGAGAAT CACTTCAGCCTGAGCAACACAGCGAGACTC TGT CTCAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 
amino acids 1-15 
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FIGURE 115 

CAGCAGTGGT C TCTCAG T CCT CTCAAAGCAAGGAAAGAGTAC T GTGT GCT GAGAGAC CATGG CAA 
AGAATCC T CCAGAGAAT T GTGAAGACTGT CACAT TC TAAATGCAGAAGCT T T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGAT T TACATGGAAAT TGAT CCT GTGACCAGAACTGAAATA 
T TCAGAAGCGGAAATGGCAC TGAT GAAACAT TGGAAGTGCACGACT T T AAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAAT TACCACAACT TTCTTTGAACAGTCAGTG 
AT T TGGGT CCCAGCAGAAAAGCCTAT TGAAAACCGAGAT T T TCTT AAAAAT TCCAAAATTC T GGA 
GAT T TGTGAT AACGTGACCATGTAT T GGATCAATC CCAC TC TAATAT CAGTTTC TGAGTTACAAG 
ACT T TGAGGAGGAGGGAGAAGATCT T CACTT TCCT GCCAACGAAAAAAAAGGGATTGAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAACT TCCAATAAATGACTATAC TGAAAAT GGAATAGAAT T T GAT CCCATGCT GGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTC TCCAGAAT TACT TGTAGGTAATT CC TC T C T TCAT GT T C TAATAAAC T T CTACA 
T TAT CACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



l^IAKNPPENCEDCHILNMAFKSKKICKSLKICGLVFGILALTLIVLF^GSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRYCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminaJL targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCC AGGATC ATG TCCACCACC ACATGCCAAGTGGT GGCGTT CCT CCTGT CC AT CC TGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
T ATTTC ACC ATCCT GGGACT TCCAGC CAT GCTGC AGGCAGTGCGAGCCCTGAT GATCGTAGGCATCGT CCT 
GGGTGCCAT TGGCCTCCT GGTATCCATCTTTGCCCT GAAATGCATCCGCATT GGCAGC ATGGAGGACT CT G 
CC AAAGCC AACATGAC ACTGACCTCCGGGATC AT GTTCAT T G TCTCAGGT CT TTGTGC AATT GCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGT CCAAC AC CAAAAACAAGAAG ATAT ACG AT GGAGGT GCCCGCAC AGAGGACGAGGTACAAT CTT AT C 
CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTT ATTT ATGAAT T AGAGGCT AT AGCTCACAT T TTC AAT CC TCTATT T CTT TT TTTAAAT ATAAC T 
TT CT AC TCTGAT GAGAGAAT GT GGTT TT AAT CTC TCTCTC AC AT TT TGATGATTT AGACAGACT CCCC CT C 
TT CCTCCT AGTC AATAAACC CATT GATGAT CT AT TTCCCAGCTT AT CCCC AAGAAAACTTTT GAAAGGAAA 
GAGT AGACCC AAAGATGTT AT T TT CT GCT GTTTGAATT TT GTCT CCCCAC CCCC AACTTGGCTAGT AAT AA 
AC ACTTACTGAAGAAGAAGC AATAAGAG AAAGAT AT TT GT AATC TCTCCAGCCC ATGATCTC GGTT TT CT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAATTGTC CC TAGATG AAT GAGAAAATT AT TTTT TTTAAT TT AAGTCCTAAAT ATAGTT AA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAATGAAAAAAT AAT T G CT TT GACATT GTCT AT ATGGTACTTT GT AAAGTCATGCTT AAGT ACAAAT TC C 
AT GAAAAGCT C ACACC TGTAATCC T AGC ACTT T GGG AGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
TC GAGACT AGCCT GGG CAAC AT GGAG AAGCCC TGTCT C TAC AAAAT ACAGAG AGAAAAAATCAGCCAGTC A 
TGGT GGCAT ACAC CTGTAGT CCC AGC AT T CCGGGAGGCTGAGGT GGGAGG ATCACTTGAGCCCAGGGAGGT 
TGGGGC TGCAGT GAGC CATGAT CACACC AC TGCACTCCAGCC AGGT GACATAGCGAGATCCT GTCTAAAAA 
AAT AAAAAAT AAAT AATGGAAC ACAGCAAGTCCT AGGAAGT AGG TT AAAACT AATTCT TT AA 
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FIGURE 118 



MS TT TCQWAFLLS I LGLAGC I AATGMDMWSTQDL YDNPVTS VFQYEGLWRSCVRQS S GFTECRP 
YFTILGLPi^LQAVRAIMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKAM4TLTSGIMFIVSGL 
CAI AGVS VFANMLVTNFWMS TANMYTGMGGMVQT VQTRYT FGAAL FVGWVAGGLTL I GGVMMC IA 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGMTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAAC TGTTCTCTTCTG T GGC ACAGAGAAC CC T GCT T CAAAGCAGAAG TAGC AGT T CC GGAGTC C 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGT GAAGGCTCACAT TCTGCT GACGGCTGGAATCATCTT CAT 
IM, CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
O ACTCAATAGTGAATGTTGCC CAAAAACGT GAGCTT GGAGAAGCTCTCTACT TAGGAT GGACCACGGCA 

O CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
fl CAGATACTCGATACC TT CCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 

TCTACTCCAGAAGTCAGTATGTG TAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
f p ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTT CTTAACTGCCT 

%j AAT CT TAATTACAGGAACTGT GCATCAGCTAT TTAT GAT TCTATAAGCTAT TTCAGCAGAATGAGATA 

% TTAAACCCAATGCTTTGATTGTT CTAGAAAGTATAGTAATT TGT TTTCTAAGGTGGT TCAAGCAT CTA 

W CTCTT TTTATCATTTACTTCAAAATGACATT GCTAAAGACTGCAT TATTTTACTACT GTAATT TCTCC 

III 

!;f ACGACATAGCATTATGTACATAGATGAGT GTAACATTTATATCTCACATAGAGACAT GCTTATAT GGT 

|y TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
r4 ATCATGGATAGGGTT GAAGAAGGT TACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 

fU TTTATAAT GAAGATT AAAAT GAAGGCTTT AAT CAGCATTGTAAAGGAAATT GAATGGCTTTCT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACT CAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGT G 
AAAAT ATT TTTGTTT TTGTAT TT GAAGAAGAATGATGCATT TTGACAAGAAATCATATAT GTATGGAT 
AT ATT TTAAT AAGTAT T T GAGTACAGAC T TT GAGGT T T CAT CAAT ATAAATAAAAGAGCAGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACATT TATAT TAATAAATTGTACATT TTT CTAAT T 
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FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIElSnsriWFENFTOGLWMNCVRQimiRMQCK 
IYDSLLALSPDLQAARGLMCAASVMS FIJAF^MAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MWL I PVS WVANAI I RDFYNS I VNVAQKRELGEAL YLGWT TALVL I VGGAL FCCVFCCNEKS SSY 
RYS I PSHRTTQKS YHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCC ATGC GACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAAT GTCTGAGGGAAAGC T T TGAGGAGTCC T GGACACCCAACTAC 
AAGCAGTG TT CATGGAGT T CAT T GAAT TATGGCATAGATC TTGGGAAAAT T GCGGAGTGTACAT T 
TACAAAGAT GCGT T CAAATAGTGC T CTAAGAGT TT TGT TCAGT GGC TCACT T CGGC TAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT TGAAGCTATAAT T TAT T TGGACCAAGGAAGCCC TGAAATGAAT T CAACAAT TAATAT TCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TATT GAAGAAC TACCAAA ATAAA TGC TT TAAT T T TCATTTGCTACCT CT T T TT T TAT TATGCC 
TT GGAATGGTTCACT TAAATGACATTTTAAATAAGT TTATGTATACATCTGAATGAAAAGCAAAG 
CTAAATAT GT T TACAGACCAAAGT GTGAT T T CACAC TGT T TT TAAAT CTAGCAT TAT TCAT T TT G 
CT T CAATCAAAAGTGGT T TCAATAT TTT TTTTAGTTGGT TAGAATAC TT T CTTCATAGTCACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGC TACCAAT CT T TGTACAAT T TG TAAATGTTAAGAAT TTT T TT TATAT C TGT 
TAAAT AAAAATTAT TT CCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
S VEGLCE G I GAGLVDVAI WVGTCS DYPKGDAS TGWNS VS R 1 1 1 EE L PK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCCTGGAAGAATACATCATGTT TT TCGATAAGAAGAAATTGTAGGAT CCAGTT TTTTT TTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTTTCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
AT CT CAGAAAT TACAGGAGATACCCTCAAGTATATCTGC TGGT TGC T TAGGT T TGT CCCT T CGC T 
Q ATAACAGCCT T CAAAAAC T TAAGTATAATCAATT TAAAGGGCT CAACCAGCT CACCTGGC TATAC 

S. CT T GACCATAACCATATCAGCAATATT GACGAAAAT GCT T TTAAT GGAATACGCAGAC TCAAAGA 

GC T GAT T CT TAGTTCCAATAGAAT C TCCTATT T TCT TAACAATACC T TCAGACCTGTGACAAATT 
W TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
M CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
If! AGACT GCCGCAACC T GGAAC TT TTGGACCT GGGATATAACCGGAT CCGAAGTT TAGCCAGGAAT G 

p| TCT T T GCTGGCATGATCAGACT CAAAGAACT TCACC TGGAGCACAAT CAATT T TCCAAGC TCAAC 

t*l CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCT TGAT TTATCAGGCAATGAGA 
3 TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
Q T CCAACAAGCT CACAT T TAT TGGT CAAGAGAT T TTGGAT TCT TGGATAT CCC TCAATGACATCAG 

m TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
j£* TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
W AT CGAT GCAGT GAAGAAC TACAGCATCT GTGGCAAAAGTAC TACAGAGAGGT T TGATCTGGCCAG 

Til GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
Q TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
p;; TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGT CCC TAAAGCAAATGACT CCCAGCACCCAGGAAT TT TATGTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCT CCAGGGAGTGTGAGGTATGAACCAT TGT GAT AAAAAGAGCTC T TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT T TT CC TC TCATACATAAT CAACCCAT TGAAAT T TAAATACCACAAT CAATGTGAAGC T T 
GAACT C CGGTT TAATATAATACCTATT GTATAAGACCC T T TACT GAT T CCAT TAAT GTCGCAT T T 
G T T T T AAG AT AAAAC T T C T T T C AT AGG T AAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLS GSAVALVI APT VLLTMLS S AERGC PKGCRCEGKMVYCESQKLQE I PS S I SAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
IARWFAGMIRLKELHLEHNQFSKLNIALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS I CGKS T TERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHI S FHKI IAGS VALFLSVLVI LLVI YVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12s 



CCGTTATCGTCTTGCGCTACTGCTGAATCTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGT T GGGAGACGGTGCAAGAGAAT CTGCCCCC TATAGGGGAAT GGT GCGCACAGCCC TAGGGAT C 
AT T GAAGAGGAAGGCT T TCTAAAGCT T TGGCAAGGAGT GACACCCGCCATT TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT TGCGAT T TCGTGGT GTACAT CATGCAT T TGCAAAAAT CT TAGCTGAAGGAGGAATAC 
GAGGGCT T TGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACT GGT GAATATGGGAGATT TA 
ACCACT TAT GATACAGTGAAACACTACT T GGTAT TGAATACACCACT TGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACT T T T GTATAAAT CAT CG 
ACT GACT GCT TGAT TCAGGCT GT T CAAGGTGAAGGATT CATGAGTCTATATAAAGGC TT T T TACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
T GAG T G GAG T C AG T C CAT T T TAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VTGGMMAGVI GQFLANPTDLVKVQMQME GKRKLEGKPLRFRGVHHAFAKI LAEGGI RGLWAGWVP 
NIQRAAIiVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 



Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCC TC T CCAT GT TC TCAGC TATCCGTTCTCAGCACAGCGGTGTAGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAAT GTGGACGATGGGCACATCAT TAACATCAATAGCAT GTCT GGCCACCGAGTGT TA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT T CGCCT TCAAACTCCACGACAAGGACCC TGAGAAGGCAGCT GCCACC TATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGAT TGGAGACAT CCAGAT GAGGCCCACGGAGCAGGTGACC TAG TGAC TGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCAC T TCC T GTCCACACCCCGACCAGGGGC TAGAAAAT T T GT T TGAGAT T T TTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCT GGCCCAGT GGATT TCATGGTGAT CAT TAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKW 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFTSTVNV 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 

ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTAC ATGG GCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAAT TT TTAACTGATAGTTGTACATAT TTGGGGGTACATGTGATATTTGGATACATGTATACAA 
TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATTTATT TT TTATTCTTT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAAT GCCCAAC TAAT T T T TGTAT T T T TAGTAGAGACGGGGT T T T GCCATGT TGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TTCCATCCAT GTTGCTGCAAATGACAGGATT TCGTTCT TAATTTCAATTAAAATAACCACACAT G 
GCAAAAA 
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FIGURE i5*o 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature* 

amino acids 68-79 
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FIGURE 131 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCATGATCAGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
T CCAT CTGTGTGTCGC TGCGAT GCGG GTTTCATT TACTGT AATGAT CGCT TTCT GACAT C CATT CC AAC AG 
GAAT AC C AGAGGAT GC TACAACTCTC TACCTT C AGAAC AACC AAAT AAAT AATGCT GGGATT CCTT C AGAT 
TT GAAAAACT TGCT GAAAGT AGAAAGAAT AT ACCT ATACC ACAACAGTTT AGATGAAT TTCCTACC AACCT 
CC CAAAGT AT GT AAAAGAGTT ACAT T TGCAAGAAAATAAC AT AAGGACT ATC ACTT AT GAT T C ACT T T CAA 
AAAT T C CCT ATCTGGAAGAATT ACAT TTAGAT GAC AACTCTGT C TCTGC AGT T AGC AT AGAAGAGGGAGC A 
T T CCGAGAC AGCAACT AT CT CCGACT GCT TTT CCT GTCCCGT AATCACCT T AGC ACAATTCCCT GGGGTT T 
GCCC AGGACT AT AGAAGAAC TACGCTTGGATGAT AAT CGC AT AT CC ACTATTTC AT CACC AT CTCT TC AAG 
GTCT CACT AGTC TAAAACGC CT GGTT CT AGAT GGAAACCT GTTGAACAAT CAT GGT TTAGGT GACAAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCT TT ATCTTC AAGAT AAC CACATCAATC GGGTGCCCCC AAAT GCT TT T T 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATGAT TT GGAC AAT ATAAC ACAACT GATTCT TC GCAACAATCCCT GGT ATTGCGGGT GC AAGAT GAAAT G 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGTTCGT GGGATGGC TATT AAGGAT CTCAAT GC AGAACT GTTT GATTGT AAGGACAGTGGGAT TGT AAGC 
AC CATT CAGATAACCACT GC AAT ACC CAAC ACAGTGTATCCTGCCC AAGGAC AGTGGCC AGC TCCAGTGAC 
CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAATTAC AATTACT GT GAAGTCTGT C ACCT CTGAT ACCATTCATAT CTCTTGGAAACT TGCT CT ACCT AT G 
ACTGCT TTGAGACT CAGC TGGCTT AAACTGGGCC AT AGCC CGGC ATTTGGAT CT AT AACAGAAACAAT TGT 
AAC AGGGG AACGCAGTGAGT ACTTGGT C ACAGCC CTGGAGCCT GAT TCACCCT ATAAAGT ATGC ATGGTT C 
CC ATGGAAACCAGCAACCTCTACCT AT TTG AT GAAACT CCTGT T TGTATT GAGACTGAAACTGC ACCC CT T 
CGAAT GTACAAC CCT ACAAC CACCCT CAAT CGAGAGCAAGAGAAAGAACCTT ACAAAAACCCCAATT T ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGGAATGGATCGCT CTT CT CAAGGAACTGTGCAT AT AGC AAAGGG AGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGCACTAAG AAGGAC AACT CTATCC TGGAAATCAGGGAAAC TT CTTTTCAGATGTTACCAAT 
AAGCAATG AACCC ATCTCGAAGGAGGAGT T TGT AAT AC ACACC ATATTTCCT CCTAATGGAATGAAT CTGT 
ACAAAAAC AATC AC AGTG AAAGCAGT AGT AACCGAAGC TAC AGAGACAGT GGT ATT CCAGACTC AGAT CAC 
TC ACACTC ATGAT G CT GAAGGACT CACAGC AG ACT T GT GT TT TGGGTTTT TT AAACCT AAGGGAGGTGAT G 
GT 
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FIGURE 132 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQimAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVS IEEGAFRDSNYLRLLFLSRNHLST I PWGLPRT IEELRLDDNRI ST IS S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSN1MLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLN^ 

IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT IALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
W amino acids 531-552 

N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



m 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACT CCAACAGAGCCAGGACTATATCAACCTC T TC TGCGCCAACAT GAT GGAC T TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGC TTCGGGGAGCCTGATGC TGAAGATGAAGAATTATCTAAAGCTAT TCAA 
TAT CAGCAGCAT T T T TCGAGGAGAG TGAAGAGGCGAGAAAAACAAT T T CCAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC T T T CAAAGAGAC TGT GTT GAATAAAGGGC CAAGGTT C T TGC CACCCAGCAC TC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGAC CCAGGT GTGAACGGATGAATAAAGT TCAACT GCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWAFLMRGCTQGPLQQSQD 
YINLFCAM^DLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE las 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAAT TACATCAAAACATCAGAGGT TGTCAGAC TGCCC TAT CC TCTCCAAATGAAATCT TCAGGT 
H CCACCT T CT TACT TTAT TAAAAGGGAATCGT GGGGC TGGACAGACTT TC TAATGAACCCAATGGT 

*J TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
^ CT GACATGAGACGGGAAATGGAGCAGT CAAT GAATATGC TGAAT TCCAACCATGAGT TGCC TGAT 

{(I GT T TCTGAGT TCATGACAAGACTCTTC T CT T CAAAATCATC TGGCAAAT CTAGCAGCGGCAGCAG 

If! TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
CP AAACACGGCAACAC TGGGTGGCAT CCAAGT CT TGGAAAACCGT GT GAAGCAACTAC TATAAACT T 

GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 
f*t AAC TT T T TAGCACATGT T T TGTAC T TGGTACACGAGAAAACCCAGC T T T CAT C T TT TGT C TGTAT 

HI GAGGT CAATAT TGATGTCAC TGAAT TAAT TACAGTGT CC TATAGAAAAT GCCATTAATAAAT TAT 

Q AT GAACTACTATACATTATGTATAT TAATTAAAACATCT TAATCCAGAAATCAAAAAAAAAAAAA 

III AAAAAAAAAAAAAAA 

5 



APP_ID=1 0063567 



Page 287 of 320 



FIGURE 1*6 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDIPSGSYWEWS PAYRFDPVRVDI TSKGKMRARYVNYIKTSE 
WRLPYPLQMKS SGPPS YFIKRES WGWT D FIiMNPMVMMMVLPLL I FVLLPKWNTS DPDMRREME 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTAT ATGC GTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
H AAGCT GT GTGATCGCCACAAACC T TCAGGAAATACGAAATGGAT T T TCT GAGATACGGGGCAGT G 

U TGCAAGCCAAAGAT GGAAACAT TGACAT CAGAATC T TAAGGAGGACT GAGTCT T TGCAAGACACA 

K AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
8*1 1 AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
|f| CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
IP AAGAAATACAGCCAGAT TC TGAGT CACT T T GAAAAGCTGGAACCT CAGGCAGCAGT TGTGAAGGC 

^ T T T GGGGGAAC TAGACATTC T TCT GCAAT GGATGGAGGAGACAGA ATAG GAGGAAAGTGATGC T G 

1.. CT GCTAAGAATAT TCGAGGTCAAGAGC T CCAGTC T T CAATACCTGCAGAGGAGGCATGACCCCAA 

m ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
Q CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
1 1| TGTAATATCTT TC TGC TATTGGATATAT T TAT TAGT TAATATATT TAT T TATT TT TTGCTATT TA 

W ATGTAT T TAT T TT TT TAC TTGGACATGAAACT T TAAAAAAATT CACAGAT T ATAT TTATAACC T G 

ffl 

ACTAGAGCAGGTGATGTATT T T TATACAGTAAAAAAAAAAAACCT TGTAAAT TC TAGAAGAGTGG 
CTAGGGGGGT TAT TCAT T TGTAT TCAACT AAGGACATAT TTAC TCAT GC TGATGC TC TGTGAGAT 
AT T T GAAAT T GAAC CAAT GAC TAC T TAG GAT G GG T T GT GGAAT AAG T T T T GAT G T GGAAT T GCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SIANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCT CTCT TCAG TAAAGTT GT TAT T GT TC TGATAGATGCCT T GAGAGATGAT TTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCTTT TATGGAGATGAAACCTGGGT TAAATTA 
TT CCCAAAGCATT T TGTGGAATAT GATGGAACAACC TCAT TTT T CGTGT CAGAT TACACAGAGGT 
GGATAATAAT GTCACGAGGCAT TT GGATAAAGTAT TAAAAAGAGGAGAT T GGGACATAT TAATCC 
TC CACTACCT GGGGC TGGACCACAT TGGCCACAT T TCAGGGCCCAACAGCCCCC TGATTGGGCAG 
AAGC TGAGCGAGATGGACAGCGT GCTGAT GAAGAT CCACACCTCACTGCAGT CGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
ACT T GGCT TACCGAT TCCAAAAGACAGTGTAGGGAGCC TCCTAT T C CCAG TT GT GGAAGGAAGAC 
CAATGAGAGAGCAGTTGAGATTTTTACATTTGAATACAGTGCAGCT TAGTAAACTGT TGCAAGAG 
AAT GT GCCGT CATATGAAAAAGATCCT GGGT T TGAGCAGTT TAAAAT GT CAGAAAGATTGCAT GG 
GAACT GGAT CAGAC TGTACTT GGAGGAAAAGCAT TCAGAAGTCCTAT TCAACCTGGGCTCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAG GT GAT T G TAAAGAGC T GGC GG T CACAGAGGAACAAGC C C CC C AGC T GAGG GG G TG T GTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC T CAGATC CACAGAGCCCAGGAT CAAGGGACCCAC T GCAGTGGCAGCAGGACTGT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACAC TCACAGCT TT GT CATCAGGGCACAGGCT TC CT CGGAGCCAGGAT GAT CT G T G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T G CACAC AG TAT G TAG T T AC CAAAAGAAT AAAC GGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKraPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSE TGSHGAS S TEEVNTPL I L I S SAFERKPGDI RHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE iai 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAAC TGCCAGCACCT TAAGACCAC TCACACCT T CAGAGTGAAGAACT TAAAC 
CCGAAGAAAT TCAGCAT T CATGACCAGGATCACAAAGTAC TGGTCCT GGACT C TGGGAATC TCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGT GACAAGGATAAAGGACAAAGT CAT CCAT CCC T TCAGC TGAAGAAGGAGAAACTGATGAAGCT 
GGC T GC C CAAAAGGAAT CAGCACG C CG G CCC T T CAT C T T T TAT AGGGC T CAG G T GGGCT C C T GGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAAT TTTCAT TTCAACCAGTTTGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T TGAACTAAT T GTATAAAAACACCAAACCTGC TCAC T 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASS LS S AS AEKGS P I LLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFI CTS CNCNEPVGVTDKFENRKH IE FS FQPVCKAEMS PSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 



APP_ID=1 0063567 



Page 294 



FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGATGGCAGAT GAGT GAC TCCACAT CCAGAGCT GCC TCCC TTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CT GGCAT TT CCACGAGCAAAGGGACTGT GAT GAACACAATGTCAT GGCT CGT TACC TCCC TGCCA 
CAGTGGAGT T TGC TGTCCACACAT T CAACCAACAGAGCAAGGACTACTATGCC TACAGAC TGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAACTAGGTGTGGGAAATTTGAAGACGACAT TGACAACTGCCATT TCCAAGAAAGCACAG 
AGCTGAACAATACT TTCACCTGCT TCTTCACCATCAGCACCAGGCCCTGGATGACTCAGT TCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
T T GGACT TGT T TGTTAT CC TAT TT TGCAT GT GT TTGAGATCT CAGATCAGTGT TTTAGAAAATCC 
ACACATCTT GAGCC TAAT CATGT AGT GTAGATCAT TAAACATCAGCATT TTAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE iaa 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 

CT GT GCAGCT CGAGGC TCCAGAGGCACACTCCAGAGAGAGCCAAGGTT CT GACGCG ATG AGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAAT CAAGT GGAACCGGAAGGCCC T GCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
\* CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
Jlf TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
m AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
|y CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
Wl TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACAT CAGGCAC TGCGCCACCTGC T TCACAGTACT T CCCAACAACTCT TAGAGGTAG 
jp GTGTAT TCCCGTT T TACAGATAAGGAAACTGAGGCCCAGAGAGCT GAAGTACT GCACCCAGCAT C 

Ifl ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
U GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
yf CTCTAAT GAAAT TGTGAAAGCTCCATGT T TAGAAATAAATGAAAACACCTGA 

5 
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FIGURE 1*6 

MRKHLSWWWIATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLIALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCT GAT GAT T TATAGAC TCAAAGAAAACTCATGTT CAGAAGCTCT CT T CT CT TCT GGCCTCCT C T 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
C7\ACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 



APPJEN10063567 



Page 299 



FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 



H 
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Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE iaq 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGT GCAGAAAGACCTT T GAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGAT TAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTAT TC TT T 
TGT TGATAACTT GAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAAT T GAGAAAGAAAGACAATCTA 
TAAGAAGCT CCCCACTTGATAATAAGTT GAAT GTGGAAGATGTTGATT CAACCAAGAATCGAAAACTG 
ATCGATGATTATGACTCTACTAAGAGTGGATT GGAT CATAAATTT CAAGAT GATCCAGAT GGTCT TCA 
TCAACTAGACGGGACTCCT T TAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
AAAAT GACAGAGCCGTGTT T GACAAGATTGTT TCTAAACTACT TAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
TGACTCCAATGGCAGCAATT CAAGATGGT CTT GCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TTAACCTT GACAAAT GGCTT GGAAAGGAGAACTAAAACCTACAGTGAAGACAACTT T GAGGAACT CCA 
ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGATTACTATCATGAAAACACTGATT GACTTT GTGAAGATGATGGTGAAATATGGAACAATATCT 
CCAGAAGAAGGTGTT TCCTACCT TGAAAACTT GGATGAAATGATT GCTCTT CAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGC TAAGAT GGAAAAGGAATATGGAAGCT TGAAGGATT CCACAAAA 
GAT GATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCC TAT TTGGAAGC 
CATCAGAAAAAATAT TGAAT GGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTT TCAGAAAACATAATATAGCT TAAAACACTTCTAATT CTGT GATTAAAATT TTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS I RS S PLDNKLNVE DVDS TKNRKL I DD YDS TKS GLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANn^YEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKS IDSEKEAKEKETLI T IMKTLIDFVKMMVKYGTI S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 



APP_ID=10063567 



Page 302 



FIGURE isi 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATG GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCT TC TAGCTGGAGGGCT GCATGCAGGGAAGGT CAT TAAAGGTGAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGT CAGACT CACCCAGCT TCCCGAGAAT GGT GGC T GGAAT GCCCCCATCACAGACT T C TACT TCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGT GGAGGAGACCCATGGCGGACAAT CACT C TC TC TGCTC TCAGGACCCCCACGTCTGACT TAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
^ CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
\ll CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
■W TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
01 GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
hi GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
jfj TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAATGGCT CGAGC TCAGAAGATAAAAGATAAGTAGGGTATGC TGAT CCT C TT TTAAAAACCCAA 
y J GATACAATCAAAATCCCAGAT GCT GGT CTCTAT TCCCATGAAAAAGT GCT CAT GACATAT TGAGA 

\f AGACCTACTTACAAAGTGGCATATATTGCAAT TTATTTTAATTAAAAGATACCTATT TATATAT T 

« TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
fF| AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT T CCT T GTATAAAAATAAGAAAAGAAAT TAATCT TGAGG TAAGCAGAGCAGACATCATCTCTGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
O TTGTAGTAGTGATCAGGAAACAGATCT CAGCAAAGCCACTGAGGAGGAGGCTGTGC TGAGT TTGT 

p| GT GGCT GGAATCT CTGGGTAAGGAACT TAAAGAACAAAAATCATCT GGTAAT TC TT TCCTAGAAG 

« GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
*** GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGATCGGAGT T T TGCAGCCACAAGC TAAGAAACACCAAGGATT GTGGCAACC 
AT CAGAAGCTTGGAAGAGGCAAAGAAGAATTCT TCCCTAGAGGCTTTAGAGGGATAACGGCTC TG 
CTGAAACCT TAATC T CAGACT T CCAGCCT CC TGAACGAAGAAAGAATAAATT T CGGC T GT T TTAA 
GCCACCAAGGATAAT TGGTTACAGCAGC T CTAGGAAAC TAATACAGCTGC TAAAAT GATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAAT AT GGCAGAAGTGATGGCAT GCCACT TCCAAGATTAGGT TATAAAAGACAC TGCAGCT T C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCC CACAGCCACAT TAGT GAACCTAGAAGCAGAGAC TC TGTGAGATAAT CGAT GT T T GT TGTT T T 
AAGT TGCTCAGTT TTGGTCTAACT T GTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 1*2 

MVLSGALC FRMKDSALKVL YLHISfNQLLAGGLHAGKVIKGEE I S WPNRWLDASLS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAP I TDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAAT6GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T GAAGGT GATGACC T GCATAT CCAGAGGAAT GTGCAAAAGCT GAAGGACACAGTGAAAAAGC 
TT GGAGAGAGTGGAGAGATCAAAGCAAT TGGAGAACTGGAT TT GC TGTT TAT GTCTC TGAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTT TT TTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
AT GGGTGGAT TCCAAAT GAACCCC T GCGT TAGT TACAAAGGAAACCAAT GCCAC TT T TGTT TATA 
AGACCAGAAGGTAGAC T T TC TAAGCATAGATAT TTAT TGATAACAT T T CATT GTAAC TGGTGTT C 
TATACACAGAAAACAAT T TAT T TT T TAAATAAT TGTCT T TT TCCATAAAAAAGATTACT T TCCAT 
TCCT TTAGGGGAAAAAACCCCTAAATAGCT T CATGT T TCCATAAT CAGTACT T TAT AT T TAT AAA 
TGTATT TAT TATTATTATAAGACTGCAT TTTATTTATATCATT TTATTAATATGGATTTAT TTAT 
AGAAAC AT CAT T C GAT AT T GC T AC T T GAG T G T AAGGC T AAT AT T GAT AT T TAT GAC AAT AAT TAT 
AGAGCTATAACATGTTTATTTGACCTCAATAAACACTTGGATATCCC 
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FIGURE isa 

MAALQKS VS S FLMGTLAT S CLLLLALLVQGGAAAP I S SHCRLDKSNFQQP Y I TNRT FMLAKEAS L 
ADNNTDVRL I GEKL FHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEVVP FLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE iss 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGT CAGTGCCCGACT TGTGAC T GAGT GTGCAGTGC CCAGCATGTACCAGGT CAGT GCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT TAGGT GAGGACAGT TCTC TCAT TAGCCT T T T CCTACAGGTGGT TGCAT 
TCT T GGCAAT GGTCATGGGAACCCACACC TACAGCCACT GGCCCAGC TGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC T TTCTAGATATTT CCCCCT T GCTGGAGAAGAAAGAGCCCCTGGTT T TAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACATGGATGAT TCTGAGGAGGAAGCT GT TAT T GAAT GTATAGAGAT T TAT CCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE is6 

MRERPRLGEDSSLISLFLQWAFL7MVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCT T CCAGAC TCAGACCAGACCCTC TGGT GGTAAATGGACATT TTCC TACATCGGC T TCCC TGTA 
GAGCT GAACACAGTCTATTT CAT T GGGGCCCATAATAT T CCTAAT GCAAATATGAAT GAAGATGG 
H CCC T T C CAT G T C T G T GAAT T TCAC C T C AC CAGGC T GCC T AGAC CACATAAT GAAAT AT AAAAAAA 

W AGT GT GT CAAGGCCGGAAGCCT GT GGGAT CCGAACATCAC TGCTT GTAAGAAGAATGAGGAGACA 

r| 

2;; GTAGAAGTGAACT TCACAACCACTCCCCTGGGAAACAGATACATGGCTCT TATCCAACACAGCAC 

a'«» TATCATCGGGT TT TCTCAGGTGTTT GAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 

Ifl TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
If! AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
vk TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
Q TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
fll AGT GGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCT T GCCACTCAAAAGAAGGCA 

GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGAT T CATCT GCACAAATACGTGGTGGT CTAC TT TAGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGT GCAGAACT TC TCCATGT CAAGCAGCAGGTGT CAGCAGGAAAAAGAT CACAAGCC TGCCAC G 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE is8 



S3 
W 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVETOFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYWVYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 



Important features of the protein: 



Signal peptide: 

amino acids 1-14 

Transmembrane domain: 



'*.! amino acids 290-309 



N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE isq 

AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
AT CAT CAAT GAAAACCAGCGCGT T T CCATGT CACGTAACATCGAGAGCC GCTCCACC TCCCCC TG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAAG AGGTGCATATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLS ILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGI INEN 
QRVSMS RNI E SRS T S PWNYTVTWDPNRYPSE WQAQCRNLGC INAQGKE D I SMNSVP I QQE TLW 
RRKHQGCSVS FQLEKVLVTVGCT CVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site, 

amino acids 83-87 



ill 
o 
rti 

a 

HI 



u 
q 
o 
m 

w 

If! 

01 



N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACT CCGT GCT GGAAGTAGGAGGAGAGT CAGGACTCCCAGG 
ACAGAGAGT GCACAAAC TACCCAGCACAGCC CCCT CCGCCCCCT CT GGAGGC TGAAGAGGGAT T C 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGATGAGGAAAAGT T TGGAGGAGCAGC TGACT CAGGGGT GGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
lml TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
!T TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
CI GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
O ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
ff| GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
Zl CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
Tl CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
Iff CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
Iff GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
%4 TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
;:f GT GGCT GTACTT CAC TACCCAGCAAAGCC T CCACGAGGGCAGC TC GCC T TGGAGAGTACT TAC TA 

|fl CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
Q CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
ii CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
Jjf CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
|fl CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
(If CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAA TAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQECETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENWVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSECASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence : 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACAT CCTGACGT GGGACAGCGGGCCAGAGGGCAC CCCAGACACGGT CT ACAGCATCGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCT GACGGTGGAGACGGGCAACC TCACGGAGCTC TAC TAT GCCAGGGT CACCGC T 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TCTACCACT TAGAGC TCCAGGTCAACCGCACCTACCAAATGCACCT T GGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

T CCACAGCGGCATAGCC TGT CCGAGAT CACCTAC T TAGGGCAGCCAGACATCT CCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCC TGGGGAT TTGCACAGACAGAACAT CTGACCCAAATGTGC TACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCGATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

C AAGGCAGAAAT GACAG T G CAAG GAGGAAAT GC AGGGAAAC T C C C GAGG T C CAGAG C CC CAC C T C 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATC T AGC TCGACAGAGCATGAGGCCCC TGCCTCT T C TGT CAT T GT TCAAAGGTGGGAAGAGA 

G C C T G G AAAAGAAC C AG GC C T G G AAAAG AAC C AG AAG G AG GC T G G GCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

T T CCCAGCCAGGGCAAC TGCCTGACGT T GCACGAT T TCAGC T TCAT TCCT CT GATAGAACAAAGC 

GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCT TTTCTGCAGGCAGGAGTT TCAGACCCT 

AT CCT GAGAATGGGGT T TGAAAGGAAGGTGAGGGC TGT GGCC CC T GGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

ATC TGTAATGTGGGGAT CATAACACC TACCT CATGGAGT TG T GGT GAAGAT GAAATGAAG TCAT G 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQWRTYQMHLGGKQREYEFFGLTPDT 
E FLGT IMI CVPTWAKE SAP YMCRVKT LPDRT WT YS FS GAFL FSMG FLVAVLC YL S YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGT GACGATGTATATGT CTACTACAAAC TGGCAGGGGGAT CCC TTGAAC T TTGGGC TGGA 
AGTGT T GAACACAGT T T TGGATATT T T CCAAAAGATT TGATCAAGGTACTTCATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTTTGTCTGCTTTGAAGGAGGAAGAGATGATT 
T TAATAGTTATAATGTAGAAGAGCTT T TAGGAT C T T TGGAACTGGAGGACT CTGTACC TGAAGAG 
T CGAAGAAAGCT GAAGAAGT TT CT CAGCACAGAGAGAAATC TCCT GAGGAGT CTCGGGGGCGT GA 
ACTT GACCCTGTGCC TGAGCCCGAGGCAT T CAGAGCTGATT CAGAGGAT GGAGAAGGT GC TT TCT 
CAGAGAGCACCGAGGGGCT GCAGGGACAGCCCT CAGC TCAGGAGAGCCACCC TCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
AT TGAAAGTGCCGGGAAGCGAAAGCAGAACT GGCAATAGT TCTCCT GCC TCGGTGGAGCGGGAGA 
AGACAGATGCT TACAAAGTCCTGAAAACAGAAATGAGT CAGAGAGGAAG T GGACAGTGCGT TATT 
CAT TACAGCAAAGGAT T TCGT TGGCAT CAAAATC TAAGT TT GT T T TACAAAGATT GT T T T TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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MAAAPGLLFWLFVLGMWWPGQSDL^ 

FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYECVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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CCAGGACCAGGGCGCACCGGCTCAGCCTCT CACT TGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT T GAGAATGAAGCAGAACAGAAGT TAAT AGAGAGCAT GTTGCAAAACCT GACAAAACCCGGG 
, L ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
I*, TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
U ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
Q CT TGGGGGT CCC TACC T TTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATAT T T G 

ffl CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
£a GAGACACCCATCAGAATGT GGTT GT TAC TGAAGCAGGTATAAT T CCCAAT CTAAT T TATG TTGTT 

Jjf! ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
•^J GCATAAAAGTAAAGGAAGAACAAAAACTAGT CCAAACCAGT CTACACT GT GGATT T CAAAGAGTA 

111 CCAGAAAAGAAAGTGGCATGGAAGT ATAAT AACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
\J GGATCT GTATAAGGAATGGCATCAGAACAAT AGC T TGGAATGGCTT GAAAT CACAAAGGATC TGC 

AAGAT GAACT GTAAGCTCCCCCT T GAGGCAAATAT TAAAGTAAT TT TTATAT GTC TATTATT TCA 
J w TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
Jf ACT TCAAACT TCAAGCAAATGAAATGGACAATGCAGATAAAGTTGT TATCAACACGTCGGGAGTA 

111 TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
q TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
p| ATGAACTGT T C TAATAT TTAT T T T TATGGCATCTCAT T TT T CAATACAT GCT C T T T TGAT TAAAG 

1% AAAC T TAT TAC T G T T G T CAAC T G AAT T C AC AC AC AC AC AAAT AT AG TAC C AT AGAAAAAG T T T GT 

l*i TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
III AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCC TCAAACAT TTTACTTAGAGGCAAGGAT 

TGT CTAAT T TCAAT TGTGCAAGACATGT GCCT TATAAT TATT TT TAGCT TAAAAT TAAACAGAT T 
TTGTAATAATGTAACTT TGT TAATAGGT GCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
T GACATACACAATATAAAT CATATGTCT TCACACGT TGCCTATATAAT GAGAAGCAGCTC TC TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TAT AT TAGTATACAAAGAGGT CATGT GGT TGAGACCAGGTGAATAGT CACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAACTACGAAATCGT GT GAAAATGGGT T GG 
AACCCATCAGTGATCGCATAT TCAT TGATGAGGGT T TGCT T GAGATAGAAAATGG TGGC TCC TT T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



APP ID-10063567 



Page 319 of 320 
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MSRWSLLLGAALLCGHGAFCRRVVSGQKVCFADFKHPCYKMAYFHELSSRVS FQEARLACESE 
GGVLLSLENEAEQKL I E SMLQNLTKPGTGI S DGDFWI GLWRNGDGQTS GACPDLYQWS DGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPT7WPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNYWTEAGI I PNL I YWI PT I PLLLL ILVAFGTCC FQMLHKSKGRTKTS PNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 



Q 



Transmembrane domain: 



amino acids 214-235 




N-glycosylation sites. 

amino acids 86-89, 255-258 



amino acids 2 66-2 69 



hi 



N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 



145, 212-217 
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